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Express Mail Mailing Label No.: EV192307027US 

Attorney Docket No.: RBN-003PR 

Methods for Identifying Genes Involved in Glucose Metabolism 

Government Support 

This invention was made with government support under grant number R01 CA79907 
from the National Institutes of Health. The United States Government has certain rights in this 
invention. 

Field of the Invention 

The invention provides methods and compositions for identifying and characterizing 
functionally related gene products associated with glucose metabolism, including pancreatic beta 
cell and adipocyte function. The invention also provides methods and compositions for 
identifying and characterizing therapeutic targets and therapeutics for treating diseases associated 
with altered glucose metabolism and associated cellular dysfunction. 

Background of the Invention 

■ 

Glucose homeostasis is regulated by the coordinated expression of a number of proteins 
that participate in insulin production and secretion. The pancreatic beta cells of the pancreas 
sense increased plasma glucose concentrations in the blood and activate a cascade of intracellular 
reactions leading to the controlled release of insulin from storage granules. Insulin, in turn, 
controls plasma glucose levels by stimulating glucose-uptake into insulin-sensitive tissues (e.g., 
skeletal muscle and adipose) and inhibiting hepatic glucose production. Diabetes is a disease 
characterized by an impairment of insulin control. Type 1 diabetes results from an inability of 
the pancreatic beta cell to produce insulin, forcing patients to take daily insulin injections to 
control their blood glucose. Type 2 diabetes is a metabolic disorder in which a patient becomes 
resistant to insulin's actions, leading to hyperglycemia and hyperinsulinemia. In many cases, 
Type 2 diabetes is associated with obesity and a sedentary lifestyle. Efforts have been made to 
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establish pancreatic beta cell lines from adult and embryonic stem cells and to engineer 
pancreatic beta cell-like cell lines in order to study the metabolic pathways that are activated 
during development, growth, and maintenance of pancreatic beta cells. 

Although some of the cellular pathways involved in glucose metabolism are understood, 
a number of regulatory aspects of those pathways have not been fully characterized. The 
identification of mRNAs that are co-regulated with insulin gene expression would provide 
informationabout the regulation of genes involved in controlling insulin production and 
secretion by beta cells of the pancreas. Identification of co-expressed mRNAs would also help 
identify previously unknown components of the insulin signaling and metabolic pathways in 
adipocytes and better define those pathways, as well as to provide therapeutic targets for 
diabetes, obesity, and other diseases characterized by altered glucose metabolism. 

Summary of the Invention 

Methods and compositions of the invention are used to isolate proteins and nucleic acids 
that participate in functionally-related pathways relating to glucose metabolism. In particular, 
methods and compositions of the invention are used to isolate RNA binding proteins and subsets 
of mRNAs that participate in insulin production and secretion. The invention is based oh the 
elucidation of the coordinated regulation of multiple genes at the post-transcriptional level by 
RNA binding proteins. The isolation of a pool of mRNAs associated with an RNA binding 
protein results in the identification of a functional cluster of mRNAs and related genes, the 
expression of which is coordinately regulated by the cell. The RNAs bound to a particular RNA 
binding protein may also possess common primary or secondary structures that mediate binding 
to the RNA binding protein. RNA binding proteins and mRNAs identified by methods of the 
invention are useful to elucidate glucose regulatory pathways, hi addition, RNA binding 
proteins are useful to test agents (e.g., therapeutics) for their ability to modulate those pathways 
and to regulate other genes in that participate in those pathways. 

The invention provides insight into pancreatic beta cell processes by identifying mRNAs 
and proteins that are physically associated with certain RNA binding proteins or mRNP 
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complexes associated with a phenotype of the cell, e.g., glucose metabolism. For example, the 
invention provides a ribonomic profile, and methods for identifying and characterizing a 
ribonomic profile, of the mRNAs, RNA binding proteins and mRNP complex-associated 
proteins, all of which are associated with a particular mRNP complex. . For example, co- 
regulated genes participating in glucose metabolism pathway are determined by identifying 
mRNAs associated with a particular mRNP complex known to be a participant in the pathway. 
According to the invention, mRNAs or proteins are classified into biologically relevant subsets 
on the basis of structural and/or functional relationships (e.g., that participate in the same insulin 
production or secretion pathway, or that facilitate gene expression during growth and 
development in normal and diseased pancreatic beta cells). In contrast to the static genomics and 
proteomics approaches to gene characterization and drug discovery, this "ribonomics" approach 
provides a dynamic snapshot of the flow of genetic information at a given time in the life of a 
cell or tissue, for example, involved in-glucose metabolism, in a normal or diseased state or in 
response to an environmental influence such as glucose or a drug. 

* 

* 

In one aspect, the invention provides methods for identifying comparing differential 
expression of an RNA binding protein and/or an mRNP complex-associated protein in sample 
obtained from diseased and healthy individuals. According to the invention, therapeut 
identified based upon targets identified. 



es 
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In an embodiment, the RNA binding protein is a polypyrimidine tract binding (PTB) 
protein or another RNA binding protein that is associated with glucose metabolism. Methods of 
the invention may further comprise the step of identifying as a therapeutic target an mRNA 
encoding the RNA binding protein and/or an mRNP complex-associated protein, a gene 
encoding the RNA binding protein and/or an mRNP complex-associated protein, an mRNP 
complex comprising the RNA binding protein and/or an mRNP complex-associated protein, an 
mRNA associated with the mRNP complex, and a gene encoding the mRNA associated with the 
mRNP complex. In an embodiment, gene expression from a differentiated cell, such as an 
adipocyte or pancreatic beta cell, is compared to that of an undifferentiated cell, such as a 
preadipocyte. In another embodiment, the cell sample is treated with an agent, such as insulin, a 
beta-adrenergic agonist, insulin-like growth factor-1 (IGF-1), glucagon-like peptide-1 (GLP-1), 
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tatty acid, thiazolidinediones and/or glucose and gene expression compared to an untreated cell 
to identify differential gene expression, which is indicative of a therapeutic target. 

In another aspect, the invention provides methods for identifying a gene or gene product, 
involved in a glucose metabolism pathway in a cell by identifying at least one RNA binding . 
protein or mRNP complex-associated protein that is differentially expressed in a normal versus 
diseased cell (e.g., a cell exhibiting an aberrant phenotype relating to glucose metabolism), 
isolating an mRNP complex comprising the RNA binding protein or mRNP complex-associated 
protein and at least one mRNA, and identifying a component of the complex as being involved in 
a biochemical pathway. In an embodiment, the RNA binding protein is a polypyrimidine tract 
binding (PTB) protein. In an embodiment, a method of the invention further comprises the step 
of contacting the cell with an agent that interacts with a component of the biochemical pathway. 
The agent may inhibit or enhance the biochemical pathway. The pathway may be, for example, 
an insulin production pathway or a lipogenesis pathway. In an embodiment, the cells are 
preadipocytes, adipocytes or pancreatic beta cells. 

In another aspect, the invention provides methods for identifying an agent that interacts 
with or regulates a component of a biochemical pathway involved in glucose metabolism by 
contacting a sample With an agent, preparing a ribonomic profile of the agent-treated sample, and 
comparing the level of expression of the component in the agent-treated sample to the level of 
expression of the component in a control sample, wherein a difference in the expression of the 
component is indicative that the agent is capable of interacting with or regulating the component. 
The ribonomic profile comprises expression of at least one component of the biochemical 
pathway associated with at least one mRNP complex. The component may be an RNA binding 
protein, an mRNA, and/or an mRNP complex-associated protein. 

In another aspect, the invention provides methods for assessing the efficacy of an agent 
as a therapeutic in treating an individual having a disease associated with altered glucose 
metabolism. Methods comprise the steps of contacting a sample from an individual having a 
disease with an agent, preparing a ribonomic profile of the agent-treated sample and comparing 
the level of expression an RNA binding protein or mRNP complex-associated protein in the 
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Brief Description of the Drawings 

Figure 1 A is a schematic outlining a strategy for the isolation of an RNA binding protein, 
RBPl, using a RiboTrap™ assay, followed by the isolation and identification of mRNAs 
associated with PTB, using the RAS™ assay. 

Figure IB is a schematic outline a strategy for the isolation of a 5' untranslated region- 
binding RNA binding protein and a 3 • untranslated region-binding RNA binding protein using a 
RiboTrap™ assay, followed by the isolation and identification of mRNAs associated with PTB 
using the RAS™ assay. 

Figure 2 is a schematic for the stimulus-coupling mechanism of insulin secretion in 
pancreatic beta cells. 

Figure 3 is a schematic overview of the data flow for analyzing and interpreting 
microarray results from comparative RNA binding protein expression and/or mRNP complexes. 

• Figure 4 is a schematic overview of the RiboTrap™ assay and the Ribonomic Analysis 
System (RAS™) assay. 

Figure 5 is a schematic overview of the target discovery process using RNA binding 
proteins and mRNP complexes. \ 

• * 

Figure 6 is a schematic overview of a RiboTrap™ assay. An mRNA encoding an RNA 
binding protein (RBPl or RBP2) of interest tagged with a ligand such as MSH coat protein (CP) 
binding site (RNA stem loops) is introduced into a cell by transfection and expressed. The tag 
allows for recovery of the mRNA with its attached RNA binding protein. A binding partner for 
the ligand such as CP antibody is used to immunoprecipitate the tagged mRNA and its associated 
RNA binding proteins. 

Figure 7 is a Western blot illustrating the in vitro verification that PTB specifically binds 
the 3'UTR and not the 5'UTR of the insulin mRNA from INS-1 cell lysates. Lanes 1 and 2- no 
oligo; Lanes 3 and 4: control oligo; lanes 5 and 6: 5' UTR oligo; Lanes 7 and 8: 3' UTR oligo 



Figure 8 illustrates glucose-regulated RBP binding to preproinsulin mKNA. 
Figure 9 is a schematic overview of target discovery of primary adipocytes. 

Figure 10 is a graphical representation of expression data generated on a human Ribochip™ 
microarray comparing RNA binding protein expression of adipocytes and preadipocytes These 
graphs are output of GeneSpring comparison of data sets showing relative gene expression levels 
between two conditions or samples. 

Figure 1 1 A is a graphical representation of expression data generated on a human Ribochip™ 
microarray comparing RNA binding proteins that are down-regulated in human adipocytes 
compared to preadipocytes. 

Figure 1 IB is a graphical representation of expression data generated on a human Ribochip™ 
microarray comparing RNA binding proteins that are up-regulated in human adipocytes 
compared to preadipocytes. 

i 

Figure 12 is a list of RNA binding proteins that are increased 2-fold or more in adipocytes 
compared to preadipocytes from lean individuals. 

« 

Figure 13A is a graphical representation of expression data generated on a human Ribochip™ 
microarray comparing the effects of lOOnm insulin or luM 03 agonist BRL-37344 on RNA 
binding protein expression over basal levels in human adipocytes from lean individuals. 

Figure 13B is a graphical representation of expression data generated on a human Ribochip™ 
microarray showing the expression of RNA binding proteins that are increased or decreased 1 5- 
fold or more over basal levels by treatment of adipocytes from lean individuals with lOOnm 
insulin or luM 03 agonist BRL-37344. 

Figure 14 is a list of RNA binding proteins that are increased 2-fold or more over basal levels 
after treatment of adipocytes from lean individuals with insulin. 
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Figure IS isalist ofRNA binding proteins fc, m 2 . fold or more fa adj 
compared to preadipocytes from obese individuals. 

Figure 16A-16Kisalistof3231 mRNAs generated from RAS™ analysis of the FIB RNA 
binding protein. 

Figure 17 identifies the ion channel proteins associated with PTE that have not previously been 
associated with the glucose metabolic pathway. 

Detailed Description 

■ 

The invention provides methods for mining and characterizing the cellular ribonome to 
cetis that participate to glucnse metabolism. Specifically, the invention allows elncidation of 
cnflufar gene regdation by identifying and measnring mRNAs and protetos mat are functionaffy 
co-assocrated with mRNP complexes that participate in glncose metaboUsm. 

Methods of the invention compose identrfytog components of mRNP complexes. The 
rdenfrfied component are usetnf for diagnosing, monitoring, and assessing me metabolic or 
dtsease stete of a cel.. Components of mRNP complexes can also be nsed to identity potential 
ttrerapeutic targe* as well as for assessing the efficacy or toxicity of potential therapeutics 
Moreover th. invention provides methods for identifying and chamctertotog stiucturaHy and/or 
functionally related gene products, and to elucidate features of biological pathways. 

Generally, an mRNP complex comprises various components that may include but are 
not hnfited to, at least one RNA binding protein, at least one associated or bound mRNA and at 
least one associated or bound protein (/.,, an mRNP complex-associated protein). An mRNP 
complex may also comprise other associated or bound molecules (e. g .. carbohydrates, lipids, 
vxtamms, etc.). A component associates with an mRNP complex if it binds or otherwise attaches 
to the mRNP complex with a Kd of about lO* to about 10*. In preferred embodiment the 
component associates with the complex with a Kd of about 10" 7 to about 10" 9 In a more 

preferred embodiment, the component associates with the complex with a Kd of about lO* to 
about 10 . 
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— ■» - —I* ■» a cell, and, preferably, identifying the components of 

dre mRNP complex and me gene precursors «a ^ pmdaea of(hose a 

gene expression profile is genenrted. By identifying the mRNA components of a ceUnlar 
nbonome, me cellular transcriptome is broken down into a series of subprofijes tha, togemer are 
used to define me gene expression state of a coll or tissue. He associated or bound mRNAs are 
categonzed mto distinct subset based on their association with a particular RNA binding 
protei* mRNP complex-associated protein, or common shucmral feahue. mboBomic profiles 
drifer from cell sample to cell sample, depending on a variety of factors including, but no, 
hnuted to, me differentiation stems of the ceU, me species or tissue type of the cell the 
developmental stege of me cel., the pathogenicity of me cell (e.g., if the cell is infected, is 
expressmg a deleterious gene, is lacking a particular gene, is no, expressing a particular gene or 
■a overexpresstng a particular gene), the specific ligands used to isolate me mRNP complexes 
me venous conditions affecting mc cel. ( , g ., environmentej, apoptotic or stress stetes, L ' 
msease or other disomer), as wel. as other factors known ,o practitioners in me art 




Messenger RNP complexes associated with glucose metabolism are identified by 
generatmg an expression pattern of RNA binding protcms in stimulu^ehaUcnged ceHs Tie 
expression profiles e Mb .e me identification of selecuvely-expressed and/or selecively- 
reguteted RNA binding proteins for further analysis. For glucose-reguteted precesses 
mRNA samples fiom isiete, MJN-6. and INS-. cel.s are subjected ,o different concenttations 
of glucose for varying periods of time. Glucose alters dre expression of RNA binding 
protems invdved in glucose-regu.a«ed fimctions in me pancreafic bete celt so ma, tissue- 

TotT Ched ° fRNA bindin6 Pm <™ - <» • Analysis of non-bete 

cell mRNA samples (o.g.. NIH3T3) provide a baseline comparison. 

P 

k™. t miCroarray - based ' custom-designed o.igo-microareay (RiboChip™ 

MWG Brotecfi (High Point, NC) centering 1400 featores representing genes, d^tem 
preducte of which are reported to have RNA binding pre.perfes or rn A binding mo6& 
Also moluded on me array are couhol features (totel of 1 7) ma. pmvide information on' 



specificity, labeling and hybridization efficiency, sensitivity and normalization between 
experiments. The mRNA samples to be analyzed are prepared from whole cells, amplified if 
necessary, and processed for microarray hybridization. Briefly, total mRNA, or if necessary 
amplified mRNA, from the cell samples is labeled using amino allyl coupling chemistries 
(CyScribe Post Labeling Kit, Amersham). Amino allyl labeling reduces the fluorescent 
labeling bias observed with direct incorporation of cy5 and cy3-labeled nucleotides. After 
generation of the single stranded amino allyl product, the probe is fluorescently labeled with 
either cy3 or cy5. If multiple probes are to be used for a single array, they are dried down 
and are resuspended in nuclease free water containing control probes (e.g., Cot- 1 and 
Arabidopsis DNAs) and hybridization buffer. After denaturing, probes are then incubated 
with a mouse RiboChip™ microarray. Hybridization is carried out routinely at 65°C for 18 
hours. The arrays are sequentially washed with 0.5x SSC/0.1% SDS and then 0.6x SSC. 
Data are gathered from the array using a GenePix 4000B scanner and associated software. 
The data are then processed further for clustering\profiling and graphical representation using 
GeneSpring. In addition to providing descriptive measures of data quality, statistical 
techniques are used to group or cluster genes based on similarities and differences in their 
expression profiles. A flowchart summarizing die data analysis is provided in Figure 3. 
These methods enable genes with unique or "interesting" expression profiles to be quickly 
identified from among the thousand genes profiled in a single experiment. In one such 
application of clustering, the expression profiles for 24 genes encoding RNA binding 
proteins from 33 normal human tissues were clustered (Pearson) and represented as a 
dendogram using GeneSpring (Silicon Genetics). Each column represents a distinct RNA 
binding protein gene and each row represents a distinct tissue sample. The data was 
normalized to the median expression value. Yellow indicates expression within 10-fold of 
the median value; blue indicates expression more than 10-fold below the median value; and 
red indicates expression more than 10-fold above the median value. This representation 
enabled groups of, and distinctions between, expression profiles to be quickly identified. 



RiboTrap 
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RiboTrap is an alternative, more directed, means for identifying RNA binding 
proteins. RiboTrap™ combines a biochemical and molecular biological approach for 
isolating an unknown RNA binding protein or set of RNA binding proteins that interact with 
an mRNA of interest This involves several different approaches including the use of 1) 
epitope-tagged binding elements as affinity reagents for in vitro isolation of RNA binding 
proteins and 2) expression of an epitope-tagged mRNA in cell culture models. Approaches 
for isolating the glucose-responsive RNA binding protein (s) in pancreatic beta cells and 
RNA binding proteins interacting with Nkx6.1 mRNA using current technologies are 
described in more detail below. RiboTrap™ is useful when it is necessary to first identify an 
RNA binding protein on a specific mRNA. RiboTrap™ methods are described in more detail 

* 

in Example 2. 

An mRNP complex is isolated from a natural biological sample such as an islet, a 
pancreatic beta cell, an adipocyte, or a preadipocyte, or a population of cells. The population of 
cells may contain a single cell type. Alternatively, the population of cells may contain a mixture 
of different cell types from either primary or secondary cultures or from a complex tissue such as 
an islet or tumor. 

Freshly prepared islets are the most physiologically relevant in vitro model system for 
examining glucose-responsiveness and endocrine pancreas functions. Preparation of islets 
and islet mRNA samples are prepared as described in Example 2. To identify RNA binding 
proteins that undergo changes in expression, mouse islets are incubated under conditions of 
low (2.8 mM) or high (16.6 mM) glucose for various periods of time (up to 24 hours). Total 
mRNA is prepared using Trizol according to standard methods. In some cases where 
samples are limiting, it may be necessary to amplify the mRNA. This is routinely 
accomplished using the RiboAmp™ kit (Arcturus). 

Islets are composed of four unique cell-types a, p, y, and 5. All RNA binding proteins 
expressed in islets can be profiled or RNA binding proteins from pancreatic beta cell line and 
an alpha cell line can be prepared. The use of cell lines is also advantageous for easier 

... 

manipulation of conditions as well as providing a culture system for future validation of 
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targets. The mouse-derived pancreatic beta cell line, MIN-6, may be used Compared to 
available pancreatic beta cell lines, MIN-6 cells respond well to glucose and have been used 
for numerous studies for glucose-stimulated insulin secretion and proinsulin biosynthesis As 
with the islets, the MIN-6 cells are exposed to low and high glucose concentrations for 
various periods of time and mRNA prepared and processed for microarray analysis on the 
mouse RiboChip™. 

The alpha cell line, a-TC1.6, may be used for comparison to islets' and MIN-6 cells 
These cells express mRNA for Nkx6.1, but do not express Nkx6. 1 protein. Incontrast, 
pancreatic beta cells express both mRNA and protein for Nkx6.1. Current evidence supports a 
role of RNA binding proteins in this restrictive expression during islet development. 

In one embodiment, the mRNP complex is isolated from a cell sample in which the 
expression of a component of an mRNP complex has been altered, induced, inhibited or 
over-expressed. In another embodiment, a particular mRNP complex or component or precursor 
for one or more components of the mRNP complex has been introduced into the sample or has 
been genetically altered. Introduction of the one or more mRNP complex components may 
occur by infection, transformation, or other similar methods known in the art In one 
embodiment, an expression vector expressing one or more components of an mRNP complex is 
transfected into the cell. Suitable vectors include, but are not limited to, recombinant vectors 
such as plasmid vectors or viral vectors. The component is preferably operatively linked to 
appropriate promoter and/or enhancer sequences for expression in the cell. In an embodiment of 
the xnvention, a specific cell type is engineered to contain a cell type-specific or inducible gene 
promoter that drives expression of an RNA binding protein. A ligand, such as an antibody that is 
specific for this RNA binding protein, may immunoprecipitate the RNA binding protein with its 
attached or associated mRNAs, from a tissue extract containing the cell type of interest. The 



research, as described herein. 



further 



Alternatively, the cell sample may contain a knock out cell line or knock out organism 
that etfher does not express a component of the mRNP complex or expresses decreased levels of 
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ftc component Preferably, the knock Q * cell one o, knock on, „ rganism does no, express a 
partrcdar RNA bindiag protein , „ ^ ^ fc ^ 

protem,oranmRNPcomplex^ SO cta tt dp rotein . r^AOmaing 
^ In a p reft ,red embodiment me nucleic acid encoding «,» mRNP complex oomponen, is 
and/or deteouon of me components. Accessible epitopes may be used or, where the epitopes on 

!ToZ pon T are r ccessib,e or obscured - ^ - — -mZ 

proems may be used. Suunble tags inclnde, bn, are no, hunted ,o, bioun, me MS2 protein 
bmdmg stte sequence, fire ffsnRNA 70k binding site sequenca, me UlsnRNA A binding site 
sequence fire g,0 binding she sequence (Novagen, Inc., Madison, WI ), and FLAG-TAG 
( tgma Chemical St Louis, MO). For example, a hansfonned col. contenting a teaasfocted 
vecter dnoch^g „e expmssion of a tegged RNA binding protei n can he mixed ^m other cell 

" £!T iBm °° im ° l " hUmm b ™ ~ « *-* -has an 

^ ' ha, b ^ «"-'■-— ^unopreeiplnue me 

uansformed ee„ The mRNP compiexes and associated RNAs can men be identified clr to 
form an expression profile for that cell type for further analysis. 

or tJTT* " f 0,16 M m ° re """ PlCX C ° mPOnen,S ta -"acdng 

orfi^ng 40 cell samp,c win, a known or tea, compound. The compound may be, bu, is no ' 

lumted m, a protem, a nucleic acid, a pepfide, an anubody, an a«ihody fragment a smaU 
molecule, or an enzyme. Where me compound is a nucleic acid, me nucleic acid may be an 

nucletc acd. In one embodiment me compound may alter fte expmssion of an mRNP clplex 
componen, fiuough compefiuve binding. A compound may inhibi, binding hefiveen an RnI 
bmdmg protein and an mRNA, berween an RNA binding protein and an mRNP complex- 
associated protein, or betwean an mRNA and an mRNP complex-associated pmtein, for 
sample, m another embodiment the col, sample is infected wim a pathogen, such as a virus, 
bactena, prton, fungus, parasite, or yeast to alter expression of one or more mRNA complexes 
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While any method for the isolation of an mRNP complex may be used in the present 
invention, the methods disclosed in co-pending U.S. Application Nos. 09/750,401; 10/238,306; 
and 10/309,788 are preferred, the entire disclosures of which are hereby incorporated by 
reference. The in vivo methods for isolating an mRNP complex involve contacting a biological 
sample that includes at least one mRNP complex with a ligand that specifically binds a 
component of the mRNP complex. For example, the ligand may be an antibody, a nucleic acid 
(e.g., an antisense, aptamer, or RNAi molecule), or any other compound or molecule that 
specifically binds the component of the complex. In certain embodiments, the ligand is obtained 
by using the serum of a subject that has a disorder known to be associated with the production of 
mRNP complex-specific antibodies or proteins. Examples of these disorders include 
autoimmune disorders and a number of cancers. In certain embodiments, the ligand is tagged 
with another compound or molecule in order to facilitate the separation, observation or detection 
of the ligand. In one embodiment of the invention, the ligand is epitope tagged, as described in 
the art. 

* 

In an embodiment, the mRNP complex is separated by binding the ligand (now bound to 
the mRNP complex) to a binding molecule that specifically binds the ligand. The binding 
molecule may bind the ligand directly {e.g., a binding partner specific for the ligand), or may 
bind the ligand indirectly (e.g., a binding partner specific for a tag on the ligand). Suitable 
binding molecules include, but are not limited to, protein A, protein G, and streptavidin. Binding 
molecules may also be obtained by using the serum of a subject suffering from a disorder such as 
an autoimmune disorder or cancer. In an embodiment, the ligand is an antibody that binds a 
component of the mRNP complex via its Fab region and a binding molecule binds the Fc region 
of the antibody. 

In an embodiment, the binding molecule is attached to a support (e.g., a solid support 
such as a bead, well, pin, plate, or column). Accordingly, the mRNP complex is attached to the 
support via the ligand and binding molecule. The mRNP complex may then be collected by 
removing it from the support (e.g., by washing or eluting it from the support using suitable 
solvents and conditions that are known to a skilled artisan). 
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In certain embodiments of the invention, the mRNP complex is stabilized by cross 
linking prior to binding the ligand thereto. Generally, cross-linking involves covalent binding 
(e.g., covalently binding the components of the mRNP complex together). Cross-linking may be 
earned out by physical means (e.g., by heat or ultraviolet radiation), or chemical means (e g. by 
contacting the complex with formaldehyde, paraformaldehyde, or other known cross-linking 
agents), methods of which are known to those skilled in the art. In other embodiments the 
ligand is cross-linked to the mRNP complex after binding to the mRNP complex. In additional 
embodiments, the binding molecule is cross-linked to the ligand after binding to the ligand In 
yet another embodiment, the binding molecule is cross-linked to the support. 

The methods of the invention allow for the isolation and characterization of a plurality of 
mRNP complexes simultaneously (e.g., «en masse"). For example, a biological sample is 
contacted with a plurality of ligands each specific for different mRNP complexes. A plurality of 
mRNP complexes from the sample bind the appropriate specific ligands. The plurality of mRNP 
complexes are then separated using appropriate binding molecules, thereby isolating the plurality 
of mRNP complexes. The mRNP complexes and the mRNAs contained within the mRNP 
complexes are then characterized and/or identified by methods described herein and known in 
the art Alternatively, the methods of the invention are carried out on a sample numerous times 
and the mRNP complexes are characterized and identified in a sequential fashion, with each 
iteration utilizing a different ligand. 

Amplification of the mRNA isolated according to the methods of the invention and/or the 
cDNA obtained from the mRNA is not necessary or required by the present invention. However 
the skilled artisan may choose to amplify the nucleic acid that is identified according to any of ' 
the numerous nucleic acid amplification methods that are well-known in the art (e.g., polymerase 
chain reaction (PCR), reverse transcriptase polymerase chain reaction (RT-PCR), quantitative 
polymerase chain reaction (QT-PCR), or strand displacement analysis (SDA)). 
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Analysis of Isolated mRNP Complexes - RAS™ Anah 
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The present invention provides methods for assessing the metabolic or gene expression 
state of a cell such as a pancreatic beta cell. Following isolation of at least one mRNP complex, 
the level of expression of at least one mRNA associated with the mRNP complex and/ or at least 
one mRNP complex-associated protein is determined. In an embodiment, the level of expression 
of the mRNA(s) or the mRNP complex-associated protein(s) on a particular mRNP complex 
provides a subprofile that is indicative of, for example, the gene expression of a subset of 
functionally related gene products relating to pancreatic beta cell function or glucose 
metabolism. In an embodiment, a subset of mRNAs associated with a particular mRNP complex 
identifies a ribonomic subprofile that is characteristic of a functional RNA network or biological 
pathway relating to glucose metabolism. The collection of mRNA subsets for any particular cell 
or tissue sample constitutes a gene expression profile, and, more specifically, a ribonomic gene 
expression profile, for that cell or tissue, It will be appreciated that ribonomic profiles may differ 
from cell to cell as described previously. Thus, the ribonomic profile of a cell can be used as an 
identifier for the cell and can be compared with profiles or subprofiles of other cells. 
Approaches for Isolation of Functional Clusters ofmBWAc 

The preferable technique for isolating functional clusters' associated with glucose 
stimulated insulin secretion and 0-cell homeostasis is in vivo RAS™, whereby affinity reagents 
specific to the RNA binding protein of interest are .used to isolate and purify the endogenous 
RNA binding protein complexed with its unique repertoire of mRNAs (a "functional cluster") If 
this technique for isolating endogenous RNA:protein complexes is not feasible, for example due 
to ineffective affinity reagents for immunoprecipitation of the in tact endogenous complex other 
complementary approaches to determine the mRNA composition for an RNA binding protein 
can be used, including: in vivo RAS™ with tagged RNA binding proteins and in vitro RAS™ 
In vitro RAS™ 

In short, RNA binding proteins were cloned by PCR and the sequence verified and 
expressed in & coli as GST fusion proteins. Following purification, the GST-RNA binding 
protein was attached to glutathione Sepharose beads and exposed to mRNA preparations to 
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assess their abilities to selectively retain discreet mRNA pools. In general, two types of 
mRNA preparations were used, cleared cytoplasmic lysates and purified cytoplasmic RNA 
Purified cytoplasmic RNA were used to directly identify mRNAs that encode cis binding 
elements for the RNA binding protein. Using aCPl that bini to and stabilizes the a-globin 
mRNA, it was recently shown that the use of cellular lysates containing both RNA and 
protein improves the specificity of the RNA binding proteinrRNA interaction likely due to 
the presence of auxiliary factors that modulate binding. Cytoplasmic lysates therefore offer 
the best source to identify mRNAs being regulated by RNA binding protein (e.g., PTB) in 
INS-1 cells. mRNA retained by the individual GST- RNA binding proteins was profiled by 
combined microarray and QPCR analyses. For additional glucose-regulated, tissue specific 
RNA binding proteins, comparisons are made between mRNA pools retained using purified 
(naked) RNA and those selected from cytoplasmic lysates. UTR sequences are aligned to 
search for obvious consensus elements in the retained mRNA pools, and a small number 
(e.g., 5-10 UTRs) are initially evaluated to confirm direct binding by biotinylated 
ohgonucleotide-affinity chromatography (as described for RiboTrap™). Over 50 RNA 
binding proteins have been identified and *, vitro RAS™ has been performed on three of 
these: PTB, ACO-1 and HNRNP-K. Each of these RNA binding proteins binds mRNA from 
both purified RNA and lysate preparations. The bound mRNA species for each RNA 
binding protein is determined by array analysis and QPCR. 

Affinity-t agging of RBPs in vivn. 

The multicomponent nature of RNPs can interfere with efficient immunoprecipitation 
due to seclusion of reactive polypeptide epitopes. As a consequence, finding appropriate 
affinity reagents and optimal conditions for immunoprecipitation of endogenous compl 
can be challenging, m the absence of appropriate affinity reagents or when endogenous 
complexes cannot be isolated, mRNAs associated with individual RNA binding proteins 
cell are identified by using RNA binding proteins tagged with one of several generic epitopes 
such as Flag, AU1, or T7. The binding epitopes are expressed on the N- or C-terminus of 
PTB and other RNA binding proteins and introduced into an appropriate cell line for 
expression. Pooled cell lines are generated by selection (e.g., zeocin) and screened for stable 
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expression of the tagged RNA binding proteins. Commercially available antibodies (e.g., o- 
17) are used to immunoprecipitate RNPs from INS-1 cells following mock or glucose 
treatment. As a positive control, tagged PABP1 (poly A binding protein), which is known to 
bind virtually all polyadenylated mRNAs, is constructed and selected into INS-1 cells for 
parallel immunoprecipitation of RNPs. Messenger RNA pools isolated following low and 
high glucose treatment of the individual INS-1 cell lines (pooled lines) are evaluated by 
microarray analysis and selective QPCR confirmation, he use of atagged-PTB : 
isdvantageous in that the functional cluster associated of the tagged-PTB can be directly 
-compared with that isolated using the commercially available monoclonal antibody to PTB 
(Zymed). This allow for validation of the endogenous PTB cluster as well as assessment of 
the mRNA binding characteristics of tagged-PTB. 

Immunoprecipitation of endogenous complexes fRAS™i 

By far, the most powerful method available to profile endogenous RNP complexes 
from cells or tissue utilizes immunoprecipitation of endogenous RNPs from cell lysates and 
characterization of mRNA content. Polyclonal anti-peptide and\or full-length protein 
antibodies, monoclonal antibodies, or recombinant antibody libraries may be used. 
Optionally, more than one peptide antigen may be chosen based on analysis of the protein 
sequence using software for antigenic determination (Antheprot; uses Parker and Wellington 
algorithms), followed by a Blast P search in NCBI to ensure that the designed peptide is not 
significantly homologous to another protein. Peptides are selected from regions thought to 
lie outside the RNA binding domain to enrich for epitopes that are more likely to be exposed 
in the RNP complex. 15-25 amino acid peptides are contracted externally for synthesis and 
conjugation to KLH followed by immunization of rabbits for polyclonal antibody production. 

Antibodies to individual RBPs are used to isolate associated mRNAs at any given 

disease state or under experimental conditions. In contrast to the tagged RBP approach 

described above, isolation of endogenous RNPs does not require perturbation of the system 

(e.g., transfection and selection of cell lines expressing tagged RBP) prior to analysis. 

However, the rate-limiting step in this analysis is acquisition of antibodies specific for 

individual RNA binding proteins which can immunoprecipitate intact endogenous RNPs. 

Conditions have been defined using a commercial antibody for PTB that effectively 
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immunoprecipitates FIB containing RNPs from INS-1 cells. It is clear that with careful 
attention to antibody selection and optimization of lysis and washing conditions, endogenous 
RNPs can be routinely isolated from cells and tissues. Once a candidate RNA binding 
protein has been chosen for RAS™, polyclonal antibodies are made using the recombinant 
RNA binding proteins prepared as described for in vitro RAS™. 

Full-length RNA binding protein genes are PCR amplified out of appropriate cDNA 
libraries and cloned into expression vectors (pGEX orpDEST17 6X-His) for bacterial 
expression purification and antibody production. Antibodies are affinity-purified, 
characterized, and optimized for immunoprecipitation of the protein and its associated RNA 
binding proteins/mRNA complex. The ability of the antibody to precipitate ribonucleic acids 
in general is determined by a rapid, high-throughput analysis using a 2100 BioAnalyzer 
(Agilent). Non-immune controls include previously characterized RNA binding protein 
antibodies run in parallel as negative and positive controls, respectively. Specific antisera that 
are able to immunoprecipitate the RNA binding proteinVmRNA complex are used for further 
analysis. Nuclei-free cytosolic extracts are prepared essentially as described from cells (or 
tissue) that have been exposed to various experimental conditions (e.g., low and high 
glucose). For immunoprecipitation of the RNA binding protein /mRNA complex, specific 
antibodies are pre-bound to protein A beads, blocked with bovine serum albumin and washed 
extensively. After a final wash in lysis buffer, cell extracts are added. Incubation times and 
temperatures are optimized for each anti- RNA binding protein. The complexes are washed . 
extensively under nuclease-free conditions. The antibody-mRNP complex is then disrupted 
with denaturing buffer RLT containing guanidine thiocyanate, and mRNA purified using 
Qiagen RNA isolation column chromatography. The purified mRNA is then processed for 
microarray analysis on contemporary human or rodent microarrays (depending on the cell or 
tissue source) comprised of features (10,000-30,000 genes) representing up-to-date genomic 
content (Agilent or M WG). Genes observed at 'detectable' levels that are present in each of 
the experiments is considered a component in that RBP's complex and its relative fold- 
enrichment above a total RNA microarray analysis is determined. The presence of the 
candidate genes and their relative fold-enrichment over total RNA are verified and more 
accurately quantified by Q-PCR using sequence-specific primers. Routinely, genes enriched 
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at a LEVEL >2-fold above total RNA sample are considered members of that cluster. The 
above combination of in vitro and cell culture based approaches to map mRNA pools 
accurately defines the mRNA content of selected RNPs. The combined use of these 
integrated techniques offers an unparalleled algorithm to define the cell's organization of 
coordinately regulated mRNAs. 

Supporting analytic and bioinformatic platform 

The mRNA samples to be analyzed are prepared from various cell and tissue-types by 
RNA extraction with RNeasy™ (Qiagen, Inc.), quantified by absorbance (A 260 ), and stored at - 
80°C until use. RNA samples are routinely evaluated using an Agilent Bioanalyzer 2100 which 
provides a rapid and quantitative measure of 28S and 18S ribosomal RNA integrity. RNA is 
labeled with or without amplification through generation of cDNA by reverse transcription in the 
presence of amino allyl-dUTP followed by direct coupling to Cy3 or Cy5 fluorescent dyes 
(CyScribe Post Labeling Kit, Amersham). If RNA quantities are limiting (< 10 ug), nucleic acid 
amplification is accomplished with reverse transcription, cDNA tailing, and PCR amplification 
procedures prior to Cy3/Cy5 coupling. If amplification of RNA from an RNA binding protein 
complex is needed, the total RNA sample is also amplified. Hybridization is carried out 
routinely at 65 °C for 18 hours. The arrays are sequentially washed with 0.5X SSC/0.1% SDS 
and then 0.6X SSC. Microarray slides are read by an Axon 4000B scanner, images acquired by 
GenePix 4.0 (Axon), and features extracted with hnagene V4.2 (Biodiscovery). Data analysis, 
including normalization (lowess), centralization, and scaling is accomplished using GeneSpring 
4.2. 1 (Silicon Genetics). Differentially expressed genes are identified using a combination of 
methods for assessing variation in expression level between conditions. These include t-tests and 
ANOVA analysis with multiple testing corrections, fold difference filtering, and clustering 
techniques. These methods enable genes with unique or "interesting" expression profiles to be 
quickly identified from among the thousand genes profiled in a single experiment. Genes within 
a cluster are chosen based upon relative enrichment of expression compared to total RNA 
samples. Routinely, genes enriched at a LEVEL >2-fold above total RNA sample are considered 
members of that cluster (e.g., PTB cluster from Rat INS-1 cells contains 3231 genes using 2-fold 
enrichment over total RNA sample as the statistical cut-off (Figure 1 6 A- 1 6K). Data was 

20 



generated using rat INS-1 cells (BetaGene, Inc.) and MWG 1 OK Rat Pan Arrays (cat# 2250- 
000000). This analysis revealed a highly enriched (>5-fold) subset of approximately 450 genes 
The normalized intensities of many of the 650 genes were altered (>2-fold) in the clusters 
isolated from cells treated with 1 5mM glucose whereas the same genes in the total RNA analysis 
were unchanged. This suggests that glucose could regulate the appearance of many mRNAs into 
or out of the cluster. In support of the functional cluster hypothesis, numerous predicted genes 
were highly enriched in the PTB-cluster and the presence of many of these was regulated by 
glucose. Included in this list are mRNAs for GIut2, glucokinase, phosphofructokinase, Kir6 2 
(the ATP-sensitive K + -channel), SUR1 (sulfonylurea receptor 1), L-type Ca2 + -charinels, acyl- 
coa carboxylase and preproinsulin. In addition, and importantly, approximately 10% of the 450 
genes in the PTB cluster had normalized intensity values at or below detectable levels when 
analyzed by normal array analysis of total mRNA samples. Thus, the ability to isolate the PTB 
cluster, purify and identify its associated mRNAs lead to the identification of very low abundant 
genes that most likely would have been missed or ignored in a normal array analysis. The ability 
to isolate the PTB cluster, enrich for a unique subset of genes, their regulated appearance in the 
cluster and identification of very low abundant genes supports the hypothesis regarding the role 
of rbps in gene/protein expression and their utility to obtain novel target and cellular pathway 
information. Expression of all candidate mRNAs in an RNP complex chosen for further 
downstream analysis are verified at the mRNA level by Q-PCR using gene specific primers. 

" Accordingly, the present invention provides diagnostic methods for assessing the cell 
types present ma sample or a population o^ or 
preadipocytes. The method involves isolating at least one mRNP complex and detecting the 
expression of at least one component of the mRNP complex, wherein the at least one component 
is specific for a certain cell type, so that the detection of the expression of the component is 
indicative of the presence of the cell type in the population of cells. The component may be 
specific for a certain cell type within an entire sample {e.g., tissue or organism) or within the ' 
population of cells,. The sample or population of cells may be, for example, a tumor, a tissue a 
cultured cell, a body fluid, an organ, a cell extract or a cell lysate. The methods of the invention 
may also be used to determine the cell types present in a population of cells. Alternatively, cell 
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type, as used herein, may also refer to a class of cells derived from a particular tissue, a particular 
species, a particular state of difTerentiation, a particular disease state, or a particular cell cycle. 

« 

An isolated mRNP complex can be examined, in part to determine expression of its 
components, as a whole, or broken into its components. The mRNP complex can be separated 
from the ligand as a whole, or the mRNA can be separated from the ligand-RNA binding protein 
complex, followed by separation of the RNA binding protein from the ligand. Alternatively, if 
the mRNA is bound to the ligand, the RNA binding protein can be separated from the ligand- 
mRNA complex, and the mRNA then separated from the ligand. Practitioners in the art are 
aware of standard methods of separating the components, including washing and chemical 
reactions. After separation, each component of an mRNP complex can be examined and their 
identity, quantity, or other identifying factors preferably recorded (e.g., in a computer database) 
for future reference. 

cDNAs can be used to identify complementary mRNAs on mRNP complexes partitioned 
according to methods disclosed herein. cDNA microarray grids can be used to identify mRNA 
subsets en masse. Microarrays are precisely aligned grids in which each target nucleic acid (e.g., 
cDNA, oligonucleotide, or gene) has a position in a matrix of carefully spotted cDNAs. Each 
target nucleic acid examined on a microarray has a precise address that can be located, and the 
binding can be quantitated. Microarrays may be arranged in a commercially available substrate 
(e.g., paper, nitrocellulose, nylon, any other type of membrane filter, chip, such as a siliconized 
chip, glass slide, silicone wafer, or any other suitable solid or flexible support). In addition, 
mRNAs in a sample can be identified based upon the stringency of binding and washing, a 
process known as "sequencing by hybridization." 

Alternative approaches for identifying, sequencing and/or otherwise characterizing the 
mRNAs in an mRNA subset include, but are not limited to, differential display, phage 
display/analysis, SAGE (Serial Analysis of Gene Expression), and preparation of cDNA libraries 
from the mRNA preparation and sequencing of the members of the library. 
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Methods for DNA sequencing that are well known and generally available in the art may 
be used to practice any of the embodiments of the invention. The sequencing methods may 
employ such enzymes as the Klenow fragment of DNA polymerase I, SEQUENASE® 
(U.S. Biochemical Corp, Cleveland, OH), Taq polymerase (Perkin Elmer, Boston, MA), 
thermostable T7 polymerase (Amersham, Chicago, IL), or combinations of polymerases and 
proofreading exonucleases such as those found in the Elongase® Amplification System marketed 
by Gibco BRL (Invitrogen™, Carlsbad, CA). Preferably, the process is automated with machines 
such as the Hamilton Micro Lab 2200 (Hamilton, Reno, NV), Peltier Thermal Cycler (PTC200) 
(MJ Research, Watertown, MA) and the ABI Catalyst and 373 and 377 DNA Sequencers (Perkin 
Elmer, Shelton, CT). 

In an embodiment, the methods of the invention are carried on isolated nuclei from cells 
(e.g., that are undergoing developmental or cell cycle changes or that have otherwise been 

* 

subjected to a cellular or an environmental change), performing nuclear run-off assays according 
to known techniques to obtain transcribing mRNAs, and then comparing the transcribing 
mRNAs with the global mRNA levels isolated from mRNP complexes from the same cells using 
cDNA microarrays. These methods thus provide methods for distinguishing transcriptional from 
post-transcriptional effects on steady state mRNA levels en masse. As opposed to a total RNA 
or transcription profile that depicts RNA accumulation representing a steady-state level of 
mRNA, which is affected by transcriptional and post-transcriptional events, the mRNAs detected 
by nuclear run-off experiments represent only the transcription of a gene before the influence of 
post-transcriptional events. The microarrays representing mRNP complexes contain discrete and 
more limited subsets of mRNAs than the transcriptome or nuclear run-offs. The mRNP complex 
microarrays are labeled mRNP-1 through mRNP-X and depict multiple mRNAs found in mRNP 
complexes isolated by using antibodies reactive with mRNP complex-associated proteins. 

Other methods for characterizing and identifying mRNP complex components include 

standard laboratory techniques such as, but not limited to, reverse transcription or quantitative 

PCR, RNAse protection, Northern Blot analysis, Western blot analysis, macro- or micro-array 

analysis, in situ hybridization, immunofluorescence, radioimmunoassay, and 

immunoprecipitation. The results obtained from these methods are compared and contrasted in 
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order to characterize further the functional relationships of the mRNA subsets and other mRNP 

* 

components. . 

RNA binding proteins and mRNP complex-associated proteins useful in the practice of the 
present invention are known in the art, or may alternatively be identified and discovered by the 
methods described herein. RNA binding proteins are involved in the control of a variety of 
cellular regulatory and developmental processes, such as RNA processing and 
compartmentalization, mRNA splicing and transport, RNA stabilization, mRNA translation, and 
viral gene expression. Examples of useful RNA binding proteins are shown in Table 1 . 

i 

■ i 

■ 
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Table 1 RNA Binding Proteins Associated with Glucose Metabolism 



RPP30 


NCOR2 


SNRPG 


LOC165271 


NCOA5 


IMP-1 


NFKB2 


OAS1 


DDX28 


LOC56902 


HNRD 


RNASE2 


RBMY1A1 


GRTH 


LOC163412P 


RNF17 


LOCI 36197 


LOCI 68400 


ENPP2 


HE3-BETA 


RRP46 


PRIM2A 


RPS15A 


LOC129715 


ZNF85 


EIF-2Bepsilon 


LOCI 26205 

• 


LOC127164 


LOCI 34477 


BOLL 


HYPA 


RPS21 


PCBD 


LOC151613 


LOCI 38280 


OAS3 

• 


YBX2 

• 


PSP1 


HQK 


PCBP3 


A2BP1 


FLJ12455 


DDX21 
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The techniques described herein are used to identify new (i.e., novel or previously 
unknown) RNA binding proteins and mRNP complex-associated proteins associated with 
glucose metabolism is referred to as RiboTrap™ (Figure 4). Thus, in one embodiment of the 

* * • 

invention, an mRNA of interest (depicted in Figure 4 as "RNA Y") is used as "bait" to trap a 
new RNA binding protein. Preferably, RNA Y is first converted to a cDNA using standard 
molecular biology techniques and is subsequently ligated at the 3 1 or 5' end to a DNA tag that 
encodes a sequence that will bind a ligand of the present invention (the ligand being illustrated as 
protein "X" in Figure 3). In other words, the tagged DNA encodes a binding partner of the 
ligand. The resulting fusion RNA is expressed in cells, where endogenous RNA binding proteins 
can bind and interact with RNA Y. The cells are then lysed and cell-free extracts are prepared 
and contacted with Protein X, which has been immobilized on a solid support matrix. After 
incubation, Protein X and the attached RNA fusion molecule and its associated RNA binding 
proteins are washed to remove residual cellular material. After washing, the newly isolated RNA 
binding proteins are removed from the RNA-protein complex and identified by protein 
microsequencing. Useful ligands include mRNP complex-specific antibodies or proteins (e.g., 
obtained from a subject with an autoimmune disorder or cancer) or proteins. Useful binding 
partners include antibodies specific for the ligand. 

Once partial protein sequence is obtained, the corresponding RNA binding protein gene 
may be identified from known databases of cDNA and genomic sequences or isolated from a 
cDNA or genomic library and sequenced. Preferably, the gene is isolated, the protein is 
expressed, and an antibody is generated against the recombinant RNA binding protein using 
known techniques. The antibodies are then used to recover and confirm the identity of the . 

* 

endogenous RNA binding protein. Subsequently, the antibody can be used for ribonomic 
analysis to determine the subset of cellular RNAs that cluster with associate with) RNA Y. 
The RNA binding protein is further tested for its ability to regulate the translation of the protein 
encoded by RNA Y, and is tested for validation as a drug target Likewise, proteins encoded by 
the cellular RNAs that cluster with RNA Y are tested for validation as drug targets, as further 
described herein. 
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Identification of Therapeutic Targ ets 

The invention provides methods for identifying a therapeutic target by comparing the 
ribonomic subprofiles of a cell sample to the ribonomic subprofiles of a control sample. A 
difference in the expression of a component of anmRNP complex between the two samples is 
indicative that the component is a candidate therapeutic target The therapeutic target may 
include, but is not limited to, any component of an mRNP complex, or nucleic acid or gene 
product thereof. In an embodiment of the invention, the cell sample is treated with a test 
compound and the control sample comprises cells that have not been treated with the test 
compound. In another embodiment, the control sample comprises cells at a different stage in 
their growth cycle from the cells in the cell sample. In yet another embodiment, the cell sample 
comprises a tumor cell or other diseased cell, and the control sample comprises a normal cell. 
Target identification includes methods known to practitioners in the art, such as, but not limited 
to, the use of screening libraries, peptide phage display, cDNA microchip array screening, and 
combinatorial chemistry techniques known to practitioners in the art. A summary of the steps for 
target discovery is provided in Figure 5 . 

Identification of Therap eutic 

In another aspect, the invention provides methods for assessing the efficacy of a test 
compound as a therapeutic. A cell sample is contacted with a test compound and a ribonomic 
profile or subprofile of the cell sample comprising the expression of at least one gene product 
associated with at least one mRNP complex is prepared. The expression levels of the gene 
product in the cell sample are compared to the expression levels of the gene product in a control 
sample (e.g., a cell sample that is not contacted with a test compound). Identification of a 
difference in expression of the gene product between the treated and untreated cell samples is 
indicative that the test compound is a potential therapeutic. Test compounds may be, for 
example, nucleic acids, hormones, antibodies, antibody fragments, antigens, cytokines, growth 
factors, pharmacological agents (e.g., chemotherapeutics, carcinogenics, or other cells), chemical 
compositions, proteins, peptides, and/or small molecules. 
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In various embodiments of the invention, the therapeutic may stabilize or destabilize the 
mRNA or the mRNP complex-associated protein. In another embodiment, the therapeutic may 
either inhibit translation of the mRNA, inhibit transport of the mRNA or the mRNP complex- 
associated protein, inhibit the binding of the RNA binding protein to the mRNA, inhibit the 
binding of the RNA binding protein to the mRNP complex-associated protein, or inhibit the 
binding of the mRNA to the mRNP complex-associated protein, for example. 

« 

In another aspect, the invention provides methods for assessing toxicity, potential side 
effects, specificity or selectivity of a test compound, for example, by altering the concentrations 
or amounts of a test compound used to treat a cell sample. 1 

» 

i 

In yet another aspect, the present invention provides methods for assessing or monitoring 
tiie efficacy of a therapeutic in a subject. In accordance with the invention, an effective amount 
of a therapeutic is administered to a subject At least one mRNP complex is isolated from a cell 
sample from the subject, wherein altered expression of a gene product associated with the mRNP 
complex is altered by administration of the therapeutic. The expression of the gene product in 
the cell sample after administration of the therapeutic is compared to the expression of the gene 
product in a control sample (e.g., a second cell sample obtained either prior to administration of 
the therapeutic or from a normal subject). A difference in expression between the treated and the 
control cell samples is indicative of the efficacy of the therapeutic. The above tests can be 

■ 

repeated over a period of time to monitor the continued efficacy of the therapeutic. 

Therapeutics may target over- or under-expressed proteins involved in the etiology of a 
disease, disorder, or condition. Such over- or under-expression may result in destabilization or 
stabilization of RNA. 

Therapeutics that Destabilize mRNA 

If a disease, condition or disorder is characterized by overexpression of a protein, a 
therapeutic for treatment of such a condition will reduce or eliminate expression of the protein. 
For example, since RNA binding proteins enhance the stability of short-lived mRNAs encoding 
protooncogenes, growth factors and cytokines that contribute to cell proliferation, inhibition of 
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RNA binding protein production may alleviate diseases such as cancers or autoimmune diseases 
(e.g., by decreasing tumor growth or inflammation, respectively). In addition, RNA binding 
protein overexpression in several human tumors correlates with resistance to chemotherapy and 
XJV irradiation. Increased stability of c-fos, c-myc, cyclin B 1 and other short-lived mRNAs in 
response to UV-irradiation or therapeutic drugs is well known. Accordingly, inhibition of RNA 

* » 

binding protein expression in these tumors destabilizes the mRNA in the tumors and, as a result, 
renders the tumors more responsive to cancer treatments. 

In order to reduce overexpression or to cease expression of a protein of interest, the 
mRNA can be destabilized by administering an effective amount of a suitable test compound 
(e.g., an RNA binding protein inhibitor) either in vitro or in vivo. The test compound may bind 
mRNA so as to inhibit RNA binding protein binding to the mRNA, bind the RNA binding 
protein so as to inhibit RNA binding protein binding^ the mRNA, bind to and destabilize the 
mRNP complex, and/or bind the mRNA so as to directly destabilize the mRNA, for example. 
Compounds that bind to the mRNA but that do not stabilize the mRNA may inhibit the ability of 
an RNA binding protein to stabilize the mRNA. If the compound binds competitively with an 
RNA binding protein, the compound can decrease mRNA stability by inhibiting the RNA 
binding protein's ability to bind the mRNA. 

Alternatively, the test compound may inhibit RNA binding protein or mRNA expression. 

Effective test compounds (e.g., RNA binding protein inhibitors) can be readily 
determined by screening compounds for their ability to interfere with the production of RNA 
binding protein or their ability to inhibit the binding to, and/or stabilization of, mRNA, for 
example, by methods described herein. Compounds that function by inhibiting RNA binding 
protein or mRNA production can be identified by exposing cells that express the RNA binding 
protein or mRNA of interest and monitoring the levels of RNA binding protein or mRNA, 
respectively. Compounds that function by inhibiting the stabilizing effect of RNA binding 
protein on mRNA can be identified by combining RNA binding protein and an mRNA that 
would otherwise be stabilized, adding compounds to be evaluated as RNA binding protein 
inhibitors, and monitoring the binding affinity of RNA binding protein and the mRNA. 
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Compounds that increase or decrease the binding affinity of RNA binding protein and the mRNA 
can be readily determined by art known methods. 

Therapeutics that Stabilize mRNA 

If a disease, condition or disorder is characterized by underexpression of an mRNA 
stabilizing protein, a therapeutic for treatment of such a medical condition may operate by 
stabilizing the mRNA associated with the underexpressed protein. Accordingly, mRNA may be 
stabilized by administering an effective amount of a compound, either in vitro or in vivo. The 
compound may possess a similar binding ability and stabilizing effect as the RNA binding 
protein or, may promote the RNA binding protein's ability to stabilize mRNA, and/or may 
promote the production of the stabilizing RNA binding protein or the mRNA of interest. Such a 
compound may be referred to as an RNA binding protein inducer and may operate by interacting 
with the mRNA, the RNA binding protein or both. Alternatively, mRNA can be stabilized by 
administering an effective amount of a suitable RNA binding protein that possesses the 
necessary mRNA stabilizing effect 

Compounds that increase RNA binding protein production can be identified by initially 
exposing cells that express the RNA binding protein to potential inducers and, monitoring the 
levels of the RNA binding protein, in accordance with the methods described above. If the level 
of RNA binding protein expression increases, the compound is an RNA binding protein inducer. 

* 

Compounds that inhibit RNA binding protein binding to mRNA, but which bind and 
stabilize mRNA, can be identified by methods disclosed herein. A skilled practitioner may 
combine RNA binding protein and an mRNA that would otherwise be stabilized, add a 

* 

compound, and monitor the binding affinity of the RNA binding protein and the mRNA. 

* 

Compounds that increase or decrease the binding affinity of an RNA binding protein and the 
mRNA can be readily determined by evaluating the binding affinity of the RNA binding protein 
to the mRNA after exposure to the compound, as described herein. By monitoring the 
concentration of mRNA over time, those compounds that bind to the mRNA can then be assayed 
for their ability to stabilize mRNA. 

* * 
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Antibody Preparation 



Antibodies and fragments thereof that bind to mRNP complexes are generated using 
methods that are well known in the art. Such antibodies may include, but are not limited to, 
polyclonal, monoclonal, chimeric, single chain, Fab fragments, and fragments produced by a Fab 
expression library. Antibodies and fragments thereof may also be generated using antibody 
phage expression display techniques, which are known in the art. 

For the production of antibodies, various hosts including, but not limited to, goats, 
rabbits, rats, mice, and humans are immunized by injection with the mRNP complex or any 
fragment or component thereof that has immunogenic properties. Depending on the host species, 
an adjuvant is used to increase the immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances 
such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. Among adjuvants used in humans, BCG (Bacilli Calmette- 
Guerin) and Corynebacterium parvum are preferable. 

Monoclonal antibodies to the components of the mRNP complex are prepared using any 
technique that provides for the production of antibody molecules by a cultured cell line. These 
include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, 
and the EB V-hybridoma technique. Generally, an animal is immunized with the mRNP complex 
or immunogenic fragments) or conjugate^) thereof. Lymphoid cells (e.g. , splenic lymphocytes) 
are then obtained from the immunized animal and fused with immortalized cells (e.g., myeloma 
or heteromyeloma) to produce hybrid cells. The hybrid cells are screened to identify those that 
produce the desired antibody. 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding 
reagents as is known in the art. 

Antibody fragments that contain specific binding sites for mRNP complexes may also be 
generated. For example, such fragments include, but are not limited to, the F(ab')2 fragments 
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which can be produced by pepsin digestion of the antibody molecule and the Fab fragments 
which can be generated by reducing the disulfide bridges of the F^b 1 ^ fragments. Alternatively, 
Fab expression libraries are constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

Various immunoassays are used to identify antibodies having the desired specificity for 
the mRNP complex. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 
the art. Such immunoassays typically involve the measurement of complex formation between 
the component of the mRNP complex and its specific antibody. An immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes is preferred, but a competitive 
binding assay may also be employed. 

The invention provides kits containing columns in which antibodies to various mRNP 
complexes or components thereof (e.g., antibodies to RNA binding proteins) are immobilized. 
The antibodies may be conjugated to a support suitable for a diagnostic assay {e.g., a solid 
support such as beads, plates, slides or wells formed from materials such as latex or polystyrene) 
in accordance with known techniques. Antibodies may likewise be conjugated to detectable 
groups such as radiolabels {e.g., 35 S, I25 I, ,31 I), enzyme labels (e.g., horseradish peroxidase, 
alkaline phosphatase), and fluorescent labels (e.g., fluorescein) in accordance with known 
techniques. Such devices preferably include at least one reagent specific for detecting the 
binding between an antibody and the RNA binding protein. The reagents may also include 
ancillary agents such as buffering agents and protein stabilizing agents (e.g., polysaccharides and 
the like). The device may .further include, where necessary, agents for reducing background 
interference in a test, control reagents, apparatus for conducting a test, and the like. The device 
may be packaged in any suitable manner, typically with all elements in a single container, along 
with a sheet of printed instructions for carrying out the test. 

Antibodies raised against an mRNP complex can be conjugated to a drug. Upon 
administration to a patient, the antibody will bind to the mRNP complex so as to deliver a 
relatively high concentration of the drug to the desired tissue or organ. In one embodiment, an 
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antibody is conjugated to an anti-cancer drug, including, but not limited to, an antifolate an anti- 
tumor antibiotic and other tumor-treating compound 

Antibodies that bind to the mRNP complex can also be covalently or ionically coupled to 
various markers, and used to detect the presence of tumors. By administering a suitable amount 
of the marker-coupled antibody to a patient, allowing the antibody to bind the mRNP complex at 
or around a tumor site, and detecting the marker, the presence of tumors can be detected. 
Suitable markers are well known in the art, and include, but are not limited to, radioisotopic 
labels, fluorescent labels and the like. Suitable detection methods for these markers are also well 
known in the art and include, but are not limited to, positron emission tomography, 
autoradiography, flow cytometry, radioreceptor binding assays, and immunohistochemistry. 

High Throughput Scree ning Methods for Lihrarift S of Comp ounds 

In an embodiment of the invention, high throughput screening assays and competitive . 
binding assays are used to identify compounds that bind to an mRNP complex or component 
thereof from combinatorial Ubraries of compounds {e.g., phage display peptide Ubraries, smaU 
molecule libraries and oligonucleotide libraries). 

In one embodiment, an mRNP component, catalytic or immunogenic fragment thereof, or 
oUgopeptide thereof, can be used to screen Ubraries of compounds in any of a variety of drug ' 
screening techniques. An exemplary technique is described in published PCT application 
W084/03584, hereby incorporated by reference. The fragment employed in such screening can 
be free in solution, affixed to a support (e.g., solid support), borne on a cell surface, or located 
intraceUularly. 

The SELEX method, described in U.S. Patent No. 5,270,163 (Gold et al.\ hereby 
incorporated by reference, is used to screen oligonucleotide libraries for compounds that have 
suitable binding properties. In accordance with the SELEX method, a candidate mixture of 
single stranded nucleic acids with regions of randomized sequence can be contacted with the 
mRNP complex. Those nucleic acids having an increased affinity to the mRNP complex can be 
partitioned and amplified so as to yield a Ugand enriched mixture. 
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Phage display technology is used to screen peptide phage display libraries to identify 
peptides that bind to an mRNP complex or component thereof. Methods for preparing libraries 
containing diverse populations of various types of molecules such as peptides, polypeptides, 
proteins, and fragments thereof are known in the art and are commercially available. 

> . 

A library of phage displaying potential binding peptides is incubated with an mRNP 
complex to select clones encoding recombinant peptides that specifically bind the mRNP 
complex or components thereof. After at least one round of biopanning (binding to the mRNP 
complex), the phage DNA is amplified and sequenced, thereby providing the sequence for the 
displayed binding peptides. Briefly, the target, an mRNP complex, can be coated overnight onto 
tissue culture plates and incubated in a humidified container. In a first round of panning, 
approximately 2 x 10 u phage can be incubated on the protein-coated plate for 60 minutes at 
room temperature while rocking gently. The plates are then washed using standard wash 
solutions. The binding phage can then be collected and amplified following elution using the 
target protein. Secondary and tertiary pannmgs can be performed as necessary. Following the 
last screening, individual colonies of phage-infected bacteria can be picked at random, the phage 
DNA isolated and subjected to automated dideoxy sequencing. The sequence of the displayed 
peptides can be deduced from the DNA sequence. 

The biological activity of compounds can be evaluated using in vitro assays known to 
those skilled in the art {e.g., protein synthesis assays or tumor cell proliferation assays). 
Alternatively, the biological activity of the compounds is evaluated in vivo. Various compounds, 
including antibodies, can bind to mRNP complexes and components thereof with varying effects' 
on mRNA stability. The activity of the compounds once bound can be readily determined using 
the assays described herein. 

Binding assays include cell-free assays in which an RNA binding protein and an mRNA 
are incubated with a labeled test compound. Following incubation, the mRNA, free or bound to 
a test compound, can be separated from unbound test compound using any of a variety of 
techniques known in the art. The amount of test compound bound to an mRNP complex or 
component thereof is then detenrnned, using detection techniques known in the art. 
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Alternatively, the binding assay is a cell-free competition binding assay. In such assays, 
mRNA is incubated with labeled RNA binding protein. A test compound is added to the reaction 
and assayed for its ability to compete with the RNA binding protein for binding to the mRNA. 
Free labeled RNA binding protein can be separated from bound RNA binding protein. By 
subsequently determining the amount of bound RNA binding protein, the ability of the test 
compound to compete for mRNA binding can be assessed. This assay can be formatted to 
facilitate screening of large numbers of test compounds by linking the RNA binding protein 
the mRNA to a support so that it can be readily washed free of unbound reactants. A p!asti< 
support (e.g., a plastic plate such as a 96 well dish) is preferred. The RNA binding protein and 
mRNA suitable for use in the cell-free assays described herein can be isolated from natural 
sources (e.g., membrane preparations) or prepared recombinantly or chemically. The RNA 
binding protein can be prepared as a fusion protein using, for example, known recombinant 
techniques. Preferred fusion proteins include, but are not limited to, a glutatmone-S-transferase 
(GST) moiety, a green fluorescent protein (GFP) moiety that is useful for cellular localization 
studies or a His tag that is useful for affinity purification. 

A competitive binding assay may also be cell-based. Accordingly, a compound, 
preferably labeled, known to bind an mRNP complex or component thereof, is incubated with 
the mRNP complex or component thereof in the presence and absence of a test compound. By 
comparing the amount of known test compound associated with cells incubated in the presence 
of the test compound with that of cells incubated in the absence of the test compound, the affinity 
of the test compound for the RNA binding protein, mRNA and/or complex thereof can be 
determined. Cell proliferation can be monitored by measuring the uptake into cellular nucleic 
acids of labeled bases (e.g., radioactively, such as 3 H, SiC, or 14 C; fluorescently, such as 
CYQUANT (Molecular Probes); or colorimetrically such as rdU (Boehringer Mannheim) or 
MTS (Promega)) as known in the art. Cytosolic/cytoplasmic pH determinations can be made 
with a digital imaging microscope using substrates such as bis(carboxyethyl)-carbonyl 
fluorescein (BCECF) (Molecular Probes, Inc., Eugene, Oregon). 
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Other types of assays that can be carried out to determine the effect of a test compound 
on RNA binding protein binding to mRNA include, but are not limited to, the Lewis Lung 
Carcinoma assay and extracellular migration assays such. as the Boyden Chamber assay. 

• # 

Accordingly, the methods permit the screening of compounds for their ability to 
modulate the effect of an RNA binding protein on the binding of and stability of mRNA. Using 
the assays described herein, compounds capable of binding to mRNA and modulating the effects 
on those cellular bioactivities resulting from mRNA stability and correlated protein synthesis are 
identified. The compounds identified in accordance with the above assays are formulated as 
therapeutic compositions. 

i 

Diagnosing and Monitoring Disease 

In another aspect, the invention provides methods for diagnosing a disease or risk of a 

i 

disease related to glucose metabolism (e.g., obesity or diabetes) or cellular function. A 
ribonomic profile from a subject's cell sample is prepared and at least one mRNP complex is 
analyzed. The expression of at least one gene product, for which altered expression is indicative 
of a disease or risk of disease, is determined. The gene product may be an RNA binding protein, 
an mRNA, an mRNP complex-associated protein or other gene product bound to or associated 
with the mRNP complex. The expression of the gene product in the cell sample is compared to 
the expression of the gene product in a control sample. The control sample may be either a 
sample of normal cells or a second cell sample from the subject Alternatively, the control 
sample is a positive control from a diseased and/or normal individual. By observing the relative 
expression of the gene product in the cell sample compared to the control sample, the presence of 
a disease or risk of disease can be determined. 

In another aspect, the invention discloses a method for monitoring a disease state in a 
subject. At least one mRNP complex is isolated from a diseased subject's cell sample, wherein 
the mRNP complex has at least one gene product that is associated with the disease. The 
expression of the gene product in the subject's cell sample is compared to the expression of the 
gene product in a control sample'. The identification of a difference in the expression of the gene 
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product in the diseased subject cell sample compared to the expression of the gene product in the 
control sample is indicative of a change in the disease state of the subject For example, a 

■ 

decrease in the production of a tumor related antigen or its mRNA is indicative of decreased 
tumor load or remission; by contrast, an increase in expression of the tumor antigen is indicative 
of aggressive tumor growth. Such monitoring during drug treatment provides information about 
the effectiveness of the subject's drug regimen, and may indicate when a particular regimen is 
not, or is no longer, effective for treating the disease or condition. The control sample may be a 
second cell sample from the subject, preferably, obtained when the subject is free of one or more 
symptoms of the disease. Alternatively, the control sample is from a normal subject or other 
normal cell sample. 

In summary, the present invention provides powerful in vivo methods for determining the 
ribonomic profile of a cell and detecting changes in the ribonomic profile. The invention has 

l 

numerous uses, including, but not limited to, monitoring cell development or growth, monitoring 
a cell state, and monitoring perturbations of a biological system such as disease, condition or 
disorder. The invention further provides methods for diagnosing a disease, condition, or disorder 
and determining appropriate treatment regimens. The invention also is useful for distinguishing 
ribonomic profiles among organisms such as plant, fungal, bacterial, viral, protozoan, or animal 
species. 

The present invention can be used to discriminate between transcriptional and post- 
transcriptional contributions to gene expression and to track the movement of RNAs through 
mRNP complexes, including the interactions of combinations of proteins with RNAs in mRNP 
complexes. Accordingly, the present invention can be used to study the regulation of RNA 
stability. The present invention can be used to investigate the activation of translation of 
mRNAs as single or multiple species by tracking the recruitment of mRNAs to active polysomes, 
measuring the sequential, ordered expression of mRNAs such as mRNAs that encode 
transcription factors or RNA binding proteins, and measuring the simultaneous, coordinate 
expression of multiple mRNAs. The present invention can also be used to determine the 
transacting functions of RNAs themselves upon contacting other cellular components. These 
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and numerous other uses will be made apparent to the skilled artisan upon study of the present 
specification and claims. 

The following Examples are set forth to illustrate the present invention, and are not to be 
construed as limiting thereof. 

Exemplification 
Example 1: Target Discovery Using Ribonomic Profiles 

The general steps required for target discovery using the methods of the invention are 

* 

summarized in Figure 1 and are exemplified below. 

Establishing Expression Profiles for RNA Binding Protein Genes 

RNA binding protein expression profiles are generated in normal and diseased human 
tissues or in tissues that are treated with an agent, such as glucose, an inhibitor, and/or insulin. 
Initial tissue, disease, or agent screening of RNA binding proteins is accomplished by 
Quantitative Reverse Transcriptase-PCR using oligo dT-primers and commercially available 
RNA samples (Stratagene, Inc., La Jolla, CA; Ambion, Inc., Austin, TX; BD Biosciences 
Clontech, Palo Alto, CA). 10-100fig of cDNA is used to perform Quantitative PCR using 
SybrGreen (Molecular Probes, Inc., Eugene, OR) and gene specific PCR primers on a BioRad 

» 

iCycler Quantitative PCR machine (Biorad) using protocols provided by the manufacturer. 
Experimental results are analyzed using the accompanying BioRad iCycler software. RNA 
levels for candidate RNA binding protein genes are normalized to rRNA. 

For more rapid and comprehensive screening of tissues and cell lines, a custom 
RIBOCHIP™ spotted microarray (Ribonomics, Inc., Durham, NC), designed and manufactured 
under contract by MWG Biotech USA, Highpoint, NC) may be used. The Ribochip™ contains 
known and putative human RNA binding protein genes, compiled from a wide variety of public 

m 

databases and search tools including GenBank (NCBI, Bethesda, MD), PubMed (NCBI, 

Bethesda, MD), SRS Evolution (LK)N Biosciences, Cambridge, MA), LocusLink (NCBI, 

Bethesda, MD), Protein FAMily database (pFAM); Welcome Institute Sanger Institute (Hinxton, 
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UK), GO Database (Gene Ontology™ Consortium) and Structural Classification of Proteins 
(SCOP©) Package (Medical Research Council, Cambridge, UK). This array contains 50-mer 
oligonucleotides on glass slides corresponding to greater than about 1,400 RNA binding protein 
genes in duplicate, non-contiguous positions (plus control genes). 

To screen for the expression of RNA binding proteins, RNA is prepared from cells in 
culture and from snap frozen clinical tissues according to the Qiagen RNeasy® protocol (Qiagen, 
Inc., Valencia, C A), quantified by absorbance at A260, and stored at -80°C until use. RNA 
samples are routinely evaluated using an Agilent Bioanalyzer2100, which provides a rapid and 
quantitative measure of 28S AND 1 8S ribosomal RNA integrity. Total or poly A+ RNA is 
labeled without amplification through generation of cDNA by reverse transcription in the 
presence of amino allyl-dUTP followed by direct coupling to Cy3 or Cy5 fluorescent dyes 
(TIGR SOP#M0004). The labeled RNA is used to probe the Ribochip™ to identify and/or 
quantify the level of RNA binding protein mRNA in the sample. Hybridization and washing are 
performed by standard procedures (TIGR SOP#M0005). If RNA quantities are limiting (< 10 
jag), nucleic acid amplification is accomplished with reverse transcription, cDNA tailing, and 
PCR amplification procedures prior to Cy3/Cy5 coupling. Data flow is then analyzed as 
summarized in Figure 3. Briefly, the Ribochip™ microarray slides are first scanned and read by 
a GENEPIX® Axon 4000B scanner using GENEPIX® 4.0 software (Axon Instruments, Inc., 
Union City, CA) for data acquisition. Spot features are then extracted with Biodiscovery's 
IMAGENE™ V.4.2 package (BioDiscovery, Inc., Marina Del Rey, CA). Data preprocessing, 
including intra- and inter-array data normalization, centralization, and scaling, is accomplished 
by visual (e.g., heat map) and quantitative (e.g., distribution analysis) methods implemented 
using the statistical environment R (Ross Ihaka and Robert Gentleman; R: A language for Data 
Analysis and Graphics, Journal of Computational and Graphical Statistics, 1996, 5, 299-314; 
hereby incorporated by reference) and BioConductor Suite of microarray data normalization and 
analysis libraries (BioConductor, Biostatistics Unit of Dana Farber Cancer Institute, Boston, 
MA). Final data analysis with normalization and scaling is then accomplished using gene 
clustering, statistical filtering and class prediction functions within the GENESPRING® 4.2.1 
software platform (Silicon Genetics, Redwood City, CA). Differentially expressed genes are 
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identified using a combination of methods for assessing variation in expression level between 
conditions. These Include t-tests and ANOVA analysis with multiple testing corrections, fold 
difference filtering, and clustering techniques. These methods enable genes with unique or 
"interesting" expression profiles to be quickly identified from among the thousand genes profiled 
in a single experiment Based upon array data, RNA binding proteins that are up or down 
regulated {e.g., differential RNA binding protein mRNA levels) to a statistically significant 
extent in a tissue-, disease-, or agent-specific manner are selected for confirmation studies by 
Quantitative PCR, Northern blot analysis, Western blot analysis, for example. 

To evaluate sample to sample variability, a series of self-self comparisons are performed. 
Using identical RNA samples, procedural variability in spot intensity is examined as a function 
of different slides, print tips, label, days, and investigators. Raw intensities for each of 1400 
spots on a custom array are compared following labeling with Cy3 or Cy5 fluorescent dyes. 
Based upon these analyses, the pooled coefficient of variation for between slide comparisons is 
determined (e.g., 1 1%). This represents an extremely high level of reproducibility for 
microarray analysis of gene expression. 

» 

* 

Cloning and Expression of RNA Binding Protein Genes in Bacterial Vectors 

* 

As soon as candidate, differentially expressed RNA binding proteins are identified, full 
length cDNA clones are generated by reverse transcriptase-PCR using commercial RNA tissue 
sources. Full-length plasmid clones are constructed based on phage lambda-based (att) site- 
specific recombination protocols (Invitrogen, Corp., Carlsbad, CA) for the GATEWAY™ 
pENTRD-Topo entry vectors and pDEST17 6XHis destination vectors (Invitrogen, Corp., 
Carslbad, CA). Escherichia coli (e.g., BL21SI or BL21A1) expressing polyhistidine-tagged 
RNA Binding Protein fusion proteins are grown to mid-log phase at 37 °C and induced in 0.3 M 
NaCl for BL21SI cells or in 0.2% mM arabinose or 0.1 mM IPTG for BL21 Al cells at 20-37 °C 
for about 2-6 hours (specific time based upon optimization in pilot expression studies for each 
clone). Bacterial cells are lysed by sonication and the fusion protein is purified on nickel 
columns (Qiagen, Inc., Valencia, CA) using standard methods. Insoluble fusion proteins are 
maintained and purified in the presence of 8M urea, and soluble proteins are maintained in PBS. 
Purified recombinant proteins are used for immunization of mammals (e.g., rabbits or chickens) 
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for production of polyclonal antibodies using standard methods or produced through a 
commercial contract Polyclonal antibodies are characterized by their ability to 
immunoprecipitate and detect by western blot, for example, native and recombinant proteins. 

Interrogation of mRNP complexes 

■ 

■ 

RNA binding proteins that are expressed in a tissue-, disease-, or agent-specific manner 
are markers for the cellular alterations that occur in response to the post-transcriptional 
processing of functionally related mRNAs. Changes in the abundance or constellation of RNA 
binding proteins in a cell affects the processing of any mRNAs bound to those RNA binding 
proteins. The subset of mRNAs that are associated with an RNA binding protein is indicative of 
functional co-regulation that is critically or causally involved in effecting a phenotype change in 
the cell. Thus, as a subset, those genes whose mRNAs are associated with tissue-, disease-, or 
agent-specifically altered mRNP complexes are a rich source of therapeutic targets for drug 
discovery. 

■ 

RNA binding proteins that exhibit the most dramatic variation with regard to expression 
proceed into the second stage of analysis, the Ribonomic Analysis System (RAS™) assay 
(Ribonomics, Durham, NC) a flow chart for which is illustrated in Figures 1 A and IB. The 
RAS™ assay provides affinity isolation and characterization of in vivo formed the mRNA 
complexes containing mRNA, RNA binding proteins, and/or mRNP complex-associated 
proteins. Monoclonal or polyclonal antibodies raised to a specific RNA binding protein, mRNP 
complex-associated protein, or a tag on an RNA binding protein or mRNP complex-associated 
protein of interest are used to co-immunoprecipitate and isolate the RNA binding protein and the 
associated subset of mRNAs and mRNP complex associated proteins according to standard 
methods or the manufacturers instructions. The associated RNAs are extracted and analyzed in a 
standard microarray format to identify and/or quantify the specific mRNAs associated with the 
RNA binding proteins. 

For the analysis of mRNAs associated with mRNP complexes, commercially available 
glass slide arrays (such as, e.g., Agilent Human Unigene 14K (Agilent, Palo Alto, CA) and 
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MWG Pan Human 10K (MWG Biotech, Inc., High Point, NC), or membrane arrays, such as, for 
example, Atlas™ Arrays (BD Biosciences, Clontech, Palo Alto, CA)), may be employed using 
protocols for hybridization, washing, and development provided by the array manufecturers. 

■ 

■ 

The composition ofRAS™ assay lysis buffer (RLB) may vary, depending on the binding 
characteristics of a particular RNA binding protein. Basic RLB contains 50 mM HEPES, pH 7- 
7.4, 1% NP-40, 150 mM NaCl, 1 mM DTT, 100 U/ml RNase OUT, 0.2 mM PMSF, 1 ng/ml 
aprotinin and 1 ug/ml leupeptin. Variations of these basic components included changes in salt 
concentrations (e.g., about 0-500 mM NaCl or about 0-5 mM KC1), ionic conditions (about 0-10 
mM MgCl 2 or about 0-20 mM EDTA), and reducing environment (about 0-5 mM DTT). For 
example, in order to prepare cell.extracts for exainimng the polypyrimidine tract binding protein 
(PTB) mRNP, cultured cells are washed in ice-cold PBS and scraped directly into RLB 
containing 5 mM MgCl 2 and incubated on ice for 10 minutes followed by centrifugation at 3,700 
xg for 1 0 minutes at 4 *C. 

It is necessary in certain cases to crosslink the RNA binding protein to target mRNAs 
prior to lysis and mRNP isolation. This is performed on cultured cells as well as fresh tissue 
samples. The extent of crosslinking is titrated for each cell line or tissue and monitored based on 
ability to hnmunoprecipitate mRNA in the complex. Cultured cells or tissue are incubated in 
phosphate buffered saline (PBS) containing about 0-1% formaldehyde at room temperature for 
about 15-60 minutes. Crosslinking is then quenched by the addition of 1M Tris to a final 
concentration of 250 mM and incubated further for an additional 20 minutes. The samples are 
then washed 3x in PBS containing 50 mM Tris. For cultured cells, the pellet is resuspended in 
radioimmunoprecipitation (RIPA) buffer (50 mM Hepes, pH 7.4, 150 mM NaCl, 1% NP- 
40, 0.1% SDS, 0.5% DOC and 100 U/ml RNase Out) at approximately 2 mg/ml final protein 
concentration. For tissue, the samples are resuspended in RIPA and homogenized with a 
Polytron to disrupt the tissue. Following the initial lysis, the samples (e.g., tissue or cultured 
cells) are subjected to sonication with a probe sonicator at output setting 6 (Branson 450, 
Branson Ultrasonics Corp., Danbury, CT) two times for 20 seconds each. Between sonications 
the samples are allowed to cool on ice for 2 minutes. Lysates are then cleared by centrifugation 
at 3,700 x g for 15 minutes. Subsequently, mRNP isolation as described above is performed. 
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Immimoprecipitation of mRNP and RNA Extraction 



On average, typical final protein concentrations for the cellular lysates are 2 mg/ml. 
Approximately 2mg protein is used for each immunoprecipitation condition. The cleared 
cellular extracts are incubated with primary antibody (e.g., anti-PTB is used at a final 
concentration of 10 jig/ml) or a control antibody at equal concentration (e.g., pre-immune or IgG 
sera at final concentration of 10 ug/ml) for 2 hours at 4 °C. A 25 ul aliquot of Protein A 

■ 

Trisacryl (Pierce Biotechnology, Rockford, IL) is added and the samples rotated for 1 hour at 
4°C. The immune complex is then washed 6x in RLB buffer by adding 1 ml/wash of RLB buffer 
followed by brief centrifiigations in a microcentrifuge for 30 seconds at 5,000 rpm. After the 
final wash, 50 ul of RNA extraction buffer from the PicoPure™ RNA isolation kit (Arcturus, 
Inc., Mountain View, CA) is added to the beads, vortexed briefly and centrifuged to pellet the 
beads. The extracted RNA is purified following the PicoPure™ protocol (Arcturus, Inc., 
Mountain View, CA). RNA present in the mRNP complex is then quantified using the 
RiboGreen™ assay (Molecular Probes, Inc., Eugene, OR). 

* 

Amplification of RNA for Microarray Analysis 

■ 

Since mRNA isolated from mRNP complexes represents only a small subset of total 
RNA, isolated mRNA may be amplified prior to labeling. Message Amp™ (Ambion, Inc., 
Austin, TX) is used for RNA amplification according to the manufacturer's instructions. Two 
rounds of amplification are performed prior to labeling by random primer polymerization with 
Cy3 or Cy5-dUTP. Hybridization and washing are performed according to the microarray 
manufacturer's protocols and as described above. Microarray data acquisition and analysis are 
performed as described above. 

Analysis of mRNP Cluster Microarray Results 

In a standard RAS™ analysis (e.g., normal vs. disease cells, treated vs. untreated cells), 
quantitative and qualitative changes in the total RNA content are compared to changes in the 
mRNP complex. The data obtained is fractionated into four classes: (1) mRNAs that show 
comparable quantitative changes in the mRNP complex, (2) RNAs present in the total RNA but 
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not in the mRNP complex, (3) RNAs present in the mRNP complex but apparently absent or 
below the level of detection in total RNA, and (4) RNAs that change in the cluster in a 
quantitatively different manner than in the total RNA analysis. In addition, the RAS ™ assay 
identifies genes represented by class 4 that do not change in total abundance but that are 
repartitioned within the cell for alternative processing and regulation. As a result, different 
splice variants may be translated, the mRNA might be transported to and translated at a specific 
location within the cell, or translation itself might be up or down modulated. The subsets of 
genes identified within groups 3 and 4 cannot readily be identified by any other currently 
available approach to characterization of gene expression. Analysis of mRNP complexes reveals 
mRNAs that are enriched in the complex but otherwise present at sufficiently low levels to be 
lost to background in the total RNA. 

« 

The methods of the invention identify genes that participate in the cellular pathways that 
contribute to the phenotypic changes associated with disease or certain cellular states and thus 
are attractive targets for drug discovery. In addition, the methods of the invention identify target 
classes that have proven to be tractable targets for small molecule drugs. These target classes 
include nuclear receptors (e.g., hormone receptors), G-protein coupled receptors, 
phosphodiesterases, kinases, proteases, and ion channels, among others. Other target classes of 
therapeutic interest include secreted molecules, extracellular ligands, and phosphatases. Among 
these gene classes, particularly attractive targets are those with the most restricted systemic 
expression profile. 

The RNA binding proteins identified as having altered expression in response to 
treatments, disease, or cell cycle changes are useful for targeting upstream signaling molecules 
for regulation (e.g., kinases). In addition, RNA binding proteins may be candidates for gene 
therapy (i.e., gene replacement) or therapeutic antibody targets. 

Functional Verification of mRNP Complexes 

For candidate target genes or gene products identified by the RAS™ assay, expression at 
the RNA and protein levels is confirmed by quantitative PCR and Western blot. Furthermore, 

■ 
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the function of an mRNP complex as it relates to the fate of the associated mRNAs, such as 
stability, degradation, or subcellular localization, is explored through a variety of techniques 
including, but not limited to, confocal microscopy, in situ hybridization, 3-hybrid reporter 
analysis to confirm ternary interaction between mRNA and the RNA binding protein or mRNP 
complex-associated protein, and in vitro methods assessing biochemical activities (e.g., 
overexpression or knockout studies). Such studies are supported by the in vitro demonstration of 
RNA binding protein binding to specific nucleotide sequences typically found in the 5' or 3' 
untranslated regions of the mRNA. 

■ 

Example 2: Identification and Immunoprecipitation of Preproinsulin RNA Binding 

Proteins Using RiboTrap™ 

» 

An alternative method for purifying and identifying RNA binding proteins is the 
RiboTrap™ assay (Ribonomics, Durham, NC). Two approaches for RiboTrap™ are described 
below. The first approach is an in vitro affinity-based assay using immobilized biotinylated 
oligonucleotides with sequences corresponding to RNA binding protein binding elements (Figure 
4). The second approach uses an affinity-tag placed on a full-length mRNA of interest or 
fragment of the mRNA of interest, which is expressed in a cell culture model and isolated using 
immobilized antibodies against the tag (Figure 6). 

To summarize, a cDNA representing a gene of interest or a 5' and/or 3 ' UTR, or other 
region of the cDNA that encodes an RNA binding protein binding site, is constructed using 
standard recombinant DNA or PCR techniques. For example, the cDNA is cloned into an 
expression vector possessing an appropriate mammalian cell promoter such as a CMV, SV40 
actin promoters, or alternatively an adenovirus or retrovirus vector, and transfected into 
compatible mammalian cell line. For example, for the isolation of RNA binding proteins that 
participate in glucose metabolism, the cDNA may be expressed in a preadipocyte, adipocyte, < 
pancreatic beta cell line, for example. Following expression of the engineered cDNA, a cell 
extract is prepared that maintains the association between mRNAs and their associated RNA 
binding proteins and mRNP complex-associated proteins as described above. The mRNA 
encoded by the transfected cDNA is affinity purified using an affinity protein that is known to 

44 



or 
a 



bind to the mRNA of interest, preferably one that does not interfere with the binding of the 
mRNA to its RNA binding protein(s) (Method 1 below; Figure 4). The affinity protein used may 
be linked to a solid matrix, such as agarose or Sepharose beads and may be biotinylated or 

* 

otherwise labeled. The affinity protein may also be bound to the solid matrix indirectly via 
binding to an antibody that is bound to the solid matrix (Method 2 below; Figure 6). The affinity 
protein-matrix is used to isolate the expressed RNA, along with the RNA binding proteins and/or ' 
mRNP complex-associated proteins that are associated with the mRNA in vivo. Variations on 
the method include chemical crosslinking of the RNA binding proteins and mRNAs with 
formaldehyde or the use of an epitope tagged or beaded binding element or an epitope tagged 
mRNA of interest. 

Proteins that are isolated in association with the mRNA of interest using the RiboTrap™ 
assay can be identified using standard proteomic methods. For example, Matrix Assisted Laser 
Desoiption/Ionization - Time-of-Flight Mass Spectrometry (MALDI TOF) and Tandem Mass 
Spectrometry (or Mass Spectrometry/Mass Spectrometry (MS/MS)) can be used to identify 
peptide sequences that can be subjected to database searches. Antibodies reactive with identified 
RNA binding proteins or mRNP complex-associated proteins can be raised in mammals 
according to standard methods and used to perform the RAS™ assay, as described previously. 

Methods and Materials 

■ 

Preparation of Cell Free Extracts 

Cells were removed from tissue culture plates with a rubber scraper and washed with cold 
phosphate buffered saline (PBS). The cells were resuspended in approximately two pellet 
volumes of polysome lysis buffer (PLB) containing 100 mM KC1, 5 mM MgCl 2 , 10 mM HEPES 
pH 7.0, and 0.5 % NP-40 with 1 mM Dithiothrietol (DTT), 100 U/mL RNase OUT (Gibco BRL, 
Invitrogen Corp., Carlsbad, CA), 0.2 % vanadyl ribonucleoside complex (VRC) (Gibco BRL, 
Invitrogen Corp., Carlsbad, CA), 0.2 mM Phenylmethylsulfonylfluoride (PMSF), 1 mg/mL 
pepstatin A, 5 mg/mL pepstatin, and 20 mg/mL leupeptin added fresh at the time of use. The 
cell lysate was frozen and stored at -100 °C. At the time of use, the cell lysate was thawed and 
centrifuged at 12,000 rpm in a tabletop microfuge for 10 minutes at 4°C. The supernatant was 
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removed and centrifiiged a second time at 1 6,000 rpm in a tabletop microftige for 5 minutes at 
4°C before being stored on ice or refrozen at -100°C. The mRNP containing cell lysate 
contained approximately 30-50 mg/mL total protein. 

Immunoprecipitation 

Protein A sepharose beads (Sigma Biochemicals, St. Louis, MO) were swollen 1 :5 v/v in 
NT2 buffer (50 mM T ris pH 7.4, 150 mM NaCl, 1 mM MgCl 2 , and 0.05 % NP-40) and 
supplemented with 5 % BSA A 300 jiL aliquot of the 1 :5 v/v pre-swollen: Protein A beads were 
incubated overnight at 4 °C with excess immunoprecipitation antibody (typically 5-20 \xL 9 
depending on the antibody). The antibody-coated Protein A beads were washed 5 times with 
ice-cold NT2 buffer and resuspended in 900 |iL of NT2 buffer supplemented with 100 U/mL 
RNase OUT, 0.2 % Vanady/Ribonucleoside Complexes, 1 mM DTT, and 20 mM ethylene 
diaminetetracetic acid (EDTA). The beads were briefly vortexed and 100 jiL of the mRNP 
lysate was added. The beads were immediately centrifiiged and a 100 aliquot was removed 
to represent total cellular RNA (essentially one-tenth the quantity of lysate used in the mRNP 
immunoprecipitations). The immunoprecipitation reaction and an aliquot removed to represent 
total cellular RNA were tumbled at room temperature for a time period of from zero minutes to 
about two hours. Following an appropriate incubation, the Protein A beads were washed four 
times with ice-cold NT2 buffer followed by two washes with NT2 buffer supplemented with 1 M 
urea. The washed beads were resuspended in 100 jiL NT2 buffer supplemented with 0.1 % 
sodium dodecyl sulphate (SDS) and 30 jig proteinase K and incubated for 30 minutes in a 55°C 
water bath. Following proteinase K digestion, immunoprecipitated RNA was isolated with two 
phenol/chloroform/isoamyl alcohol extractions and ethanol precipitated. 

Method 1 : In Vitro Affinity-Based Assay Using Immobilized Biotinvlated Oligonucleotides 

An in vitro affinity-based assay using immobilized biotinylated oligonucleotides is shown 
in Figure 4. Probes for affinity-purification of preproinsulin RNA binding proteins were 
synthesized and biotinylated with biotin-modified T (thymidine) by art known methods (e.g., 
Ross et al. (1997) MoL Cell. Biol. 17:2158-65). The probes for purification of preproinsulin 
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RNA binding proteins were the following: a) for 3'-UTR element one 5'- 
gaauaaaaccuuugaaagagcacuac-3% b) for 3'-UTR element two 5'- 
cccaccacuacccuguccaccccucugcaaug-3', and c) for 5'-UTR element two 5>- 
agccctaagtgaccagctacagtcggaaaccatcagcaagcaggtcattgttccaac-3 \ , In addition 



limine 



specie RNA binding p roteins . n. bioHnylated ^ ^ 



beads 



Success, NY) overnight in a 1M Nad-ccntaining buffer as described (Ross e, ai., 1997) Beads 
were washeu in high «ft buffer «„ remove unbound probe, and then combated in binding 
uff«. Concentrated ceU exunc* (50-250 mg protein) were plated as described (Rosa e, a,., 
1997) ftom mh ea ted m glucose-^ INS-1 orNIH3T3 cella. Exuncts vwe incubated with 
umnomhaed biotinyhneo probes (.-5 tug of coupfcd probe) for 4-12 boura a, 4 »C. washed, and 
protons ended u, SDS-PAOE sample buffer. After separation by SDS-PAOE bands 
corresponding to proteina specific** bound to probes are excised and subject* to MALM and 
protein sequencing. 

To specifically confirm binding of polypynmidine tract binding protein (PTE) to the 
_sulin 3' UTR, eluted PTE was analyzed by Western blot using commercially available 
PTE anubody (F lg nre 7). Both recombinant PTE and native PTE derived from INS-1 cell 
lysates was evaluated for binding. 

Figure 8 illusuutea the curen, paradigm of glupose-regnlated RNA binding protein 

binding of RBP1 (alsoknownasPTO) tothe V irre „f ■ ,. 

" Mr " 5 )«>«x»3 UTRoftheprep,oinsulminRNA,aswellas 

putat.vebmd.ng of other unidentified RBP1 RNA binding proteins. TheS'-UTRof 
prepnnnsufin mRNA contains a secondary (stem-.oop) structure (AO- .,0.8 kcahtaol) drat is 
stnular to shuchnes found in other mRNAs tha, undergo regulation of biosymheais a, the 
hanstanona. (eve,. Fu.them.ore, the stem- lo op stracmre ia conserved in mammalian 

boI^Cr ^ 5, - UTO — ^-se-reaponse element When 
bod, 5 - and 3 -UTOs are present, there is an even greato- response to gmcose. In addition, the 
gteose-stunmated trunsMon is panose be* eell-specific, since no ghtcose response is 
obsorveu m non-bcm cells. Ms shongly suggests the involvement of giucose-regulated RNA 
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binding proteins working via the 5'-UTR. In its simplest form the available data suggest a model 
in which at low or resting glucose levels, an RNA binding protein(s) is bound to the 5*-UTR of 
the preproinsulin mRNA and represses its translation. Increased glucose concentrations cause a 
change in the abundance or in the affinity of the RNA binding protein(s) for the preproinsulin 5*- 

* 

UTR, thus relieving the repression and allowing enhanced translation of preproinsulin mRNA 
(Figure 8). 

Method 2: Direct Affinity-Tagging Of mRNA With An RNA-Eoitope 

An direct affinity-tagging of mRNA with an RNA-epitope assay is illustrated in 
Figure 6. This method is based on antibody-recognition of a unique RNA stem loop 
structure. The well-characterized antibody a-glO (a.k.a. a-T7-tag) is raised against the N- 
terminus of glO fusion protein by standard methods. This antibody is used to screen a 
complex library of degenerate RNAs (10 6 molecules) representing various stem loop 
structures. Following stringent washing conditions, a single 40 nucleotide RNA species is 
identified (D10) that was specifically recognized by a-glO. Upon further characterization, 
the Dl 0 RNA is shown to mimic the peptide antigen, thus one can use the peptide for 
competition or elution. When the RNA-epitope is inserted into an mRNA, the RNA epitope- 
tagged mRNA could be specifically recovered from a mixture of total cellular mRNAs using 
a-glO. Furthermore, the antibody alone has no reactivity with total eukaryotic cellular 
mRNAs. 

The D10 RNA-epitope tag is placed at the end of the 3'-UTR of the gene for Nkx6.1 
and preproinsulin. This is accomplished by PCR cloning the tag into the full-length cDNAs 
for Nkx6.1 or preproinsulin (obtained by PCR cloning). These constructs are used for 1) 
generating in vitro transcripts for competition and affinity reagents, and 2) overexpression of 
Nkx6. 1 or preproinsulin in a mammalian cell culture model followed by recovery of the 
RNA epitope-tagged mRNA from cell extracts with a-glO. 

For the preproinsulin studies, the D10 RNA epitope-tagged preproinsulin cDNA is 
subcloned into pcDNA3.1neo and used to transfect MIN-6, a-TC1.6 and NIH3T3 cells. 
Transiently transfected cells as well as establish stable transfectants (selected with Neo) are 
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examined. Expression of the tagged mRNAs is under the control of the CMV promoter. 
Once expression of the tagged mRNA is confirmed by RT-PCR, extracts were prepared as 
described above from cells incubated in low or high glucose. Mock transfected cells are also 
examined. 

« 

Construction and transection into the various cell-types of a DIO RNA epitope- 
tagged Nkx6.1 is performed in a similar manner. For analysis, the RNA epitope-tagged 
mRNAs are isolated from the extracts using immobilized a-glO. Proteins in these complexes 
are eluted with SDS-PAGE sample buffer or using antigenic peptide (NH 2 - 
MASMTGGQQMGRC-COOH), which is previously shown to compete for the DIO epitope. 
A comparison of protein profiles obtained from the various cell extracts (including mock 
transfected cells) identified unique protein bands. The eluted proteins are processed as 
described in Example 1 above to obtain peptide sequence. One variation on this procedure 
included DIO-tagging of a fragment of the full-length mRNA (e.g., the 5'- or 3MJTR alone 
containing the DIO epitope). 

A comparison of RNA binding protein expression profiles from a-TCl .6 cells, 
pancreatic beta cells (which express both homeodomain transcription factor Nkx6. 1 mRNA 
and protein), and NIH3T3 cells is performed to identify cell-type specific RNA binding 
proteins using RiboMap™. These RNA binding proteins represent candidate proteins that 
control Nkx6.1 expression. 

RAS is then performed using antibodies to these candidate RNA binding proteins 
and the resulting functional clusters analyzed for Nkx6.1 mRNA expression. Afunctional 
cluster containing Nkx6.1 mRNA contains other mRNAs that are coordinate^ regulated, and 
perhaps code for proteins involved in development of the endocrine pancreas and/or 
pancreatic beta cell differentiation. Proteins that bind to the 5'-UTR of Nkx6.1 mRNA can 
also be purified. 

Specificity and mapping of KNA bindinp protein binding *1am» n t« 
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In order to verity potential RNA binding proteins and their binding specificity, 
competition experiments using immobilized binding sites (either biotinylated probes or D10 
epitope-tagged probes generated by in vitro transcription) are performed. For example, the 
specific binding site is immobilized with either streptavidin agarose or a-glO agarose and 
incubated with cell extracts or a recombinant RNA binding protein according to art known 
methods. The binding reactions are carried out in the absence or presence of increasing 
concentrations of control or competing non-biotinylated or non-tagged probes (synthetic 
oligonucleotides or generated by in vitro transcription, as described above). Binding is 
analyzed by 1) electrophoretic mobility shift assays as described in the art and/or 2) SDS- 
PAGE followed by Coomassie staining, to detect the presence or absence of RNA binding 
protein bands. RAS™ may also be performed as a third verification procedure. In this c? 



case 



antibodies raised against the RNA binding protein are used to immunoprecipitate complexes 
as described above and microarray analysis is performed to identify the associated mRNAs, 
one of which should be the original endogenous target mRNA. 



Example 3: Analysis of RNA Binding Protein Expression and Associated mRNAs in 
Human Adipocytes and Preadipocytes 

Adipocytes have long been considered a primary location for glucose disposal and energy 
storage in the form of triglycerides (fat). There is growing evidence that adipocytes comprise a 
critical endocrine tissue that not only responds to insulin through glucose uptake and lipogenesis, 
but also synthesizes and secretes a variety of signaling molecules involved in systemic energy 
homeostasis. An analysis of RNA binding proteins and their associated mRNAs and mRNP 
complex-associated proteins and their role in gene expression in adipocytes provides a better 
understanding of adipocyte function and can provide targets for therapeutics that treat conditions 
associated with aberrant glucose metabolism. A flow chart for an exemplary adipocyte analysis 
is provided in Figure 9. 

RNA binding proteins that are enriched in mature adipocytes vs. preadipocytes in lean 
individuals (BMI < 24) were identified as follows. Briefly, human preadipocytes were harvested 
from elective liposuction from three lean individuals. A portion of the preadipocytes were 

50 



differentiated in culture to mature adipocytes (Zen Bio, RTP, NC). The expression pattern of 
RNA binding proteins in mature adipocytes was compared to the expression pattern of RNA 
binding proteins in preadipocytes using a microarray chip, Ribochip™ V.l array (MWG 
Biotech, Germany) according to the method described in Example 1. The Ribochip™ contains 
approximately 1400 distinct RNA binding proteins. 

♦ 

Figure 10 is a graphical representation of differentially expressed RNA binding proteins 
in preadipocytes compared to mature adipocytes. Eleven RNA binding proteins were identified 
that were enriched 2-fold or more in the mature adipocytes. Figure 1 1 A is a graphical 
representation of certain RNA binding proteins that are upregulated in mature adipocytes as 
compared to preadipocytes. Figure 1 IB is a graphical representation of certain RNA binding 
proteins that are downregulated in mature adipocytes as compared to preadipocytes. 

The RNA binding proteins identified as being up- or down- regulated two-fold or more 
were identified by the methods described in Example 1. Figure 12 provides a list of those RNA 
binding proteins. These data provide a refined list of candidate RNA binding proteins for further 
validation for participation in an adipocyte function pathway (e.g., maturation or differentiation) 
and/or glucose metabolism pathway and for the isolation of associated mRNP complex- 
associated proteins and associated mRNAs. 

The effects of insulin or the pancreatic beta 3 agonist BRL-37344 on RNA binding 
protein expression in human mature adipocytes was also examined. Mature adipocytes from lean 
individuals were obtained as described above and treated with 100 nm insulin or l(xM BRL- 
37344. Cell lysates were prepared and RNA binding proteins were identified using Ribochip™ 
analysis as described above. 

The RNA binding protein expression of insulin-treated adipocytes was compared to the 
RNA binding protein expression of untreated adipocytes. Figure 1 3 A is a graphical 
representation showing the up- and down-regulation of the RNA binding proteins that are 
expressed in mature adipocytes in response to insulin or the pancreatic beta 3 inhibitor. Figure 
13B shows another graphical representation of RNA binding proteins that are up- or down- 
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regulated 1 .5-fold or more after treatment of mature adipocytes from lean individuals with 
insulin. Approximately 50 RNA binding protein genes were responsive to insulin (i.e., they were 
either up or down regulated at least 1 .5-fold). RNA binding proteins were identified using 
Ribochip™ analysis as described above. Figure 14 provides a list of RNA binding proteins that 
are increased 2-fold or more in mature adipocytes from lean individuals treated with insulin. 
These data provide a refined list of candidate RNA binding proteins for further validation for 
participation in an adipocyte function pathway (e.g., maturation or differentiation) and/or a 
glucose metabolism pathway and for the isolation of associated mRNP complex-associated 
proteins and associated mRNAs. 

In addition, the expression pattern of RNA binding proteins in mature adipocytes from 
three lean individuals was compared to the expression pattern of RNA binding proteins in mature 
adipocytes from three obese individuals (BMI < 30). Preadipocytes were obtained by elective 
liposuction and cultured as described above. Adipocytes from obese individuals showed an 
altered pattern of RNA binding protein expression. For example, only 4 of the 1 1 RNA binding 
proteins enriched in the adipocytes from lean individuals were present in the adipocytes from the 
obese individuals. Thus, 7 out of the 1 1 RNA binding protein genes are aberrantly regulated in 
obese individuals and are not induced by insulin. 

In another experiment, the RNA binding protein expression in preadipocytes from obese 
individuals was compared to expression in mature adipocytes in obese individuals. 
Preadipocytes and adipocytes were obtained from obese individuals as described above. RNA 
binding proteins were identified using Ribochip™ analysis as described in Example 1. Figure 15 
illustrates a list of 14 RNA binding proteins that were induced 2 fold or more in mature 
adipocytes from obese individuals as compared to preadipocytes from obese individuals. These 
data provide a refined list of candidate RNA binding proteins for further validation for 
participation in an adipocyte function pathway (i.e., maturation or differentiation) and/or glucose 
metabolism pathway and for the isolation of associated mRNP complex-associated proteins and 
associated mRNAs. 

* 
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Example 4: Analysis of RNA Binding Protein. Expression in Rat Pancreatic Beta Cells 
Treated with Insulin 

* 

The effect of glucose on RNA binding protein expression in rat pancreatic beta cells was 
examined! A derivative of the INS-1 rat pancreatic beta cell line, clone 832/13, was chosen 
because of its ability to mimic many of the normal functions of pancreatic islets. Whereas INS-1 
cells respond to glucose treatment with a 2-4 fold increase in insulin secretion, clone 832/13 is 
induced 8-13 fold by glucose treatment. The 832/13 cell line was treated with glucose and RNA 
binding proteins that interact with the 5' and 3' UTRs of the proinsulin mRNA were isolated. In 
particular, RNA binding proteins that bind with high affinity and specificity to the UTR elements 
and whose binding or abundance is also glucose inducible are most likely to be the mediators of 
post-transcriptional regulation of proinsulin. Briefly, 832/13 cells were grown to near 
confluence, shifted to low glucose (3mM) for 1 hour 1 8 hours prior to assay (T = -18 hours), and 
treated for 2 hours (T = 0 to T= 2 hours) with fresh medium containing 3mM or 15 mM glucose. 
Insulin secretion was measured by ELISA (ICN Pharmaceuticals) for 24, 48, 72 hours and 
expressed as ng/mL. Fold-induction of insulin levels was determined as a measure of the 
difference between insulin secreted in response to low glucose concentration versus insulin 
secreted in response to high glucose concentration. Cells jvere harvested and cytoplasmic 
extracts were prepared as described in Example 1 for immediate use or storage at -80° C. 
Lysates derived from untreated primary rat and human islets were also evaluated as controls. 

Preferably, the binding activity should be readily distinguishable as one, or at most, a few 
prominent RNA binding proteins, and not accompanied by a large number of mRNP complex- 
associated proteins. In addition, the amount of RNA binding protein is a key parameter for 
ensuring unambiguous sequence assignment for the bound RNA binding protein. Binding and 
washing conditions can be altered according to art known methods to optimize high affinity and 
sequence-specific interactions. For example, a stepwise increase in the concentration of 
monovalent or divalent cations (e.g., Na + , K + , Li + , Ca 2+ , Mg 2 *) can be used to select for high 
affinity interactions. Additionally, unrelated nucleic acid sequences can be used to either pre- 
clear cytoplasmic lysates or as a control for non-specific RNA binding. Secondly, in this 

particular system, RNA binding activity is thought to be glucose-inducible, which provides an 
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important selection tool for the assignment of regulatory RNA binding protein. Thus, the UTR 
binding activities should fluctuate with glucose induction. 

Alternatively, a yeast-based 3-hybrid screening (Bernstein et al. (2002) Methods 26:123- 
41) may be used to isolate RNA binding proteins. Briefly, activation of a reporter gene in yeast 
is dependent upon the interaction between a test RNA (e. gi , the proinsulin UTR) and candidates 
from a library of co-expressed RNA binding protein fusion proteins. From a commercial 
standpoint, the development of either or both techniques can be used to define specific regulatory 
RNA binding proteins associated with known or unknown mRNAs or mRNA fragments. 

Example 5: Identification Of mRNA Subsets Associated With PTB Complexes That Are 
Co-Regulated With Insulin mRNA Expression In Rat Glucose-Responsive Pancreatic Beta 
Cells (INS-1) Using cDNA Arrays (RAS™) 

» 

An in vitro cDNA array RAS™ assay was used to identify the subset of mRNAs 
associated with PTB and other RNA binding protein complexes, for example, in rat glucose- 
responsive islets (INS-1) cells. 

Cloning and Expression of RNA Bindin g Protein Genes that Regulate Insulin 

Regulatory RNA binding proteins that bind to the 3' UTR of the proinsulin gene were 
identified according to Example 2. RNA binding protein cDNAs were cloned and expressed in 
bacterial vectors as tagged or fusion proteins (e.g., 6X His or glutathione-S-transferase (GST)). 
For example, the RNA binding protein named polypyrintidine tract binding (PTB) protein binds 
to the 3 • UTR of proinsulin mRNA. The human PTB cDNA was cloned into a pGEX4T vector, 
which contains a GST affinity tag, and expressed in K coli cells in the presence and absence of 
glucose. The GST-PTB fusion protein was purified from bacterial lysates using the GST affinity 
tag, as described above 

Isolation of Co-Regulated mRNAs 



The GST-PTB fusion protein was used to screen for mRNAs that bind to PTB. Briefly, 
GST-PTB fusion protein was bound to a glutathione sepharose support through the GST 
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linkage according to standard methods. INS-1 cells were cultured as described in Example 2. 
Cytoplasmic lysates containing mRNA from glucose-treated INS-1 cells or from primary islets 
were exposed to the bead-bound GST-PTB fusion protein. Messenger RNAs wntainirig UTR 
elements that bind to PTB were retained on the beads. Ionic conditions for binding and washing 
were altered to select for high affinity binding of mRNAs to PTB or other RNA binding proteins, 
as described above. 

Bound mRNAs were selectively eluted with glutathione, which competes with GST to 
displace the mRNA-RNA binding protein complexes from the beads. Glutathione elution 
enables the selective elution of only those mRNAs that are bound to the RNA binding protein, 
and minimizes contamination with mRNAs that are non-specifically associated with the 
sepharose matrix. As a positive control, eluted mRNAs were enriched for the presence of 
proinsulin mRNA, which was directly assessed using quantitative PCR, according to standard 
methods. 



were i 



After assessing the overall gene expression profile of the RBP1 mRNA complexes, they 
imunoprecipitated and captured mRNAs were identified on cDNA arrays to quantitate the 
full spectrum of bound mRNAs. Briefly, cDNA array analysis was performed using Agilent 
Human Unigene 14K (Agilent, Palo Alto, CA) or MWG Pan Human 10K (MWG Biotech Inc 
High Point, NC), two-color fluorescent high density microarrays. Briefly, RNA was extracted x 
from LML-1 cell lysates and used to produce reverse transcribed probes according to standard 
methods. A pooled set of primers, complementary to the genes represented on the array, was 
used for the reverse transcription probe synthesis, which was radiolabeled with 32 Pcc-dATP. The 
radiolabeled probe was purified by passage over CHROMA SPIN™-200 columns (Clontech, 
Inc., Palo Alto, CA) and incubated overnight with an array membrane using EXPRESSHYB™ 
hybridization solution (Clontech, Inc., Palo Alto, CA) according to the manufacturer's protocols. 
Following hybridization, the array membrane was washed and visualized on a phosphorimaging 
screen (Molecular Dynamics, Sunnyvale, CA). 

Phosphorimages were scanned using the Molecular Dynamics STORM 860 System at 
100 micron resolution and stored as files. Images were analyzed using ATLASIMAGE™ 1.0 
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and 1.01 software (Clontech, Inc., Palo Alto, CA). The signal for any given gene was calculated 
as the average of the signals at two duplicate cDNA spots. As described in the 
ATLASIMAGE™ 1.0 software manual (Clontech, Inc., Palo Alto, CA), a default external 
background setting was used in conjunction with a background-based signal threshold to 
determine gene signal significance. The signal for a gene was considered significantly above 
background if its adjusted intensity (total signal minus background) was more than two-fold the 
background signal. Comparisons of multiple cDNA array images were performed using an 
average of all the gene signals on the array (global normalization) to normalize the signal 
intensity between arrays. The initial alignment of these arrays was facilitated by spiking the 
hybridization reaction with radiolabeled lambda phage markers that hybridized with six DNA 
spots on the bottom of the array membrane. Once the alignment register was established, 
subsequent blots did not require the use of spiked lambda markers for orientation. cDNA array 
images and overlays were prepared using ADOBE PHOTOSHOP® 5.0.2 (San Jose, California, 
USA). 

RBP1 complexes isolated from rat glucose-responsive islets (INS-I cells) contained the 
mRNA classes listed in Table 2. 

Table 2: Gene List for RBP1 Functional Cluster 



14 Kinases* 



1 1 Transporters 



8 Phosphatases 



18 Channels** 



4 Peptidases/Proteases 



33 Receptors 



3 Transcription Factors 



8 Transferases 



*8/14 genes have no previous role in G 



ucose Stimulate Insulin Secretion 



••14/18 genes have no previous role in Glucose Stimulate Insulin Secretion 

Figures 16A and 16B provide lists of the mRNAs that are associated with the PTB RNA 
binding protein. It is notable that many of the ion channel proteins identified on the PTB cluster 
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were not previously identified as participating in glucose metabolism (Figure 17). These 
proteins represent targets for new therapeutics that may be used to regulate a pathway that 
participates in glucose metabolism or other pancreatic beta cell function. Figure 2 illustrates 
some of the known pathways that participate in insulin secretion in pancreatic beta cells, 
indicating some of the proteins encoded by mRNAs listed in Figures 16A and 16B. 

A representative set of mRNAs enriched over five fold in the RNA binding protein- 
bound fraction was evaluated by gel retardation analysis to confirm that the bound mRNAs 
contain specific sequence elements that participate in RNA binding protein regulation. 

Confirmation That Candidate RN A Binding Proteins That Are Co-Regulated With P roinsulin 
mRNA In Vitro Bind to the Same RNP Complex In Vivo 

Sharing common RNA binding protein-binding elements with the proinsulin gene may 
represent a common requirement for the set of genes that are potentially post-transcriptionaUy 
co-regulated in concert with proinsulin. The identification en masse of binding sites for 
regulatory RNA binding proteins provides a provisional set of co-regulafed genes that can be 
compared to those identified from endogenous RNP complexes. To confirm their association 
with the proinsulin regulatory RNA binding proteins in vivo, tagged versions of PTB and the 
other putative 5' and 3' UTR-binding RNA binding proteins are expressed as tagged fusion 
proteins in INS-1 cells. Using flag- or Gl 0-epitope tags, the three putative RNA binding 
proteins (PTB and two unknown RNA binding proteins) are stably selected and expressed in 
INS-1 cells. The addition of the epitope tags streamlines the ability to immunoprecipitate the 
RNP complexes from the cells, since under most circumstances the epitope is not buried within 
the complex. Following stable selection of the tagged RNA binding proteins, RNP complexes 
containing the three putative proinsulin regulatory RNA binding proteins being analyzed by in 
vitro binding are immunoprecipitated. 

For comparison to the mRNA pools generated by in vitro binding, RNA was isolated 
from the immunoprecipitated in vivo RNP complexes and analyzed by a combination of 
microarray and quantitative PCR. 
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The combined in vitro and in vivo analysis of RNP complexes offers a powerful 
approach to the study of post-transcriptional regulation. The comparative analysis identifies 
the set of genes being coordinately regulated in conjunction with the proinsulin gene in 
response to glucose. These data provide a roadmap of the regulatory, metabolic, and 

* 

signalling pathways that act in concert to orchestrate the proper production of insulin, for 
example. Analysis of dynamic changes in RNP complex composition in islet cells will lead 
to the identification of novel diagnostic biomarkers and a collection of compelling 
therapeutic targets for modulating insulin production or other gene involved in glucose 
metabolism. However, to confirm that the mRNAs identified are in fact co-regulated with 
proinsulin mRNA, their associations need to be confirmed in vivo, for example, using 
knockout analyses as described in Example 7. 

Example 6: Validation of Functional Role for Genes in the RBP1 Functional Cluster 

To confirm that a gene identified in an mRNP plays a direct role in the etiology of a 
disease or other phenotype being studied, candidate target genes are chosen for inhibition 
studies, e.g., using antisense RNA or RNAi. For example, for each candidate therapeutic gene, 
one or more short DNA segments representing the coding sequence of that gene, is individually 
cloned into a plasmid vector in the sense or antisense direction, downstream of an appropriate 
promoter, such as a U6 polymerase IE promoter or RNAse P RNA HI . Plasmid vectors may be 
constructed that contain two or more short DNA segments of one or more candidate therapeutic 
genes in the .sense and antisense directions, downstream of a U6 polymerase III promoter or 
RNAse P RNA HI. Alternatively, one may construct an RNAi by annealing chemically 
synthesized complementary 22 bp RNAs (Dharmacon, Lafayette, CO). 

Following transfection of the vector or double stranded RNA into cultured cells, 
phenotypic characteristics are evaluated to determine the effect of inhibiting the expression of 
the candidate target gene(s). In addition, to verify inhibition of gene expression at the RNA and 
protein levels, Northern blots, quantitative RT-PCR, Western blot, or other analytical assay of 
time course experiments are performed to demonstrate the efficacy and duration of inhibition for 
the individual genes. 

■ 
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Transections can result in transient expression for one to five days. Alternatively, 
vectors expressing RNAi can be stably expressed in cultured cells by co-transfection and 
selection with a dominant selectable marker, such as neomycin. As alternatives to the use of 
RNAi, traditional antisense DNA or vectors expressing dominant negative forms of targets of 
interest can be used. Antisense and dominant negative genes can be delivered by direct DNA 
transfection or through the use of virus vectors including, but not limited to, retroviruses, 
adenoviruses, ,adeno associated viruses, baculoviruses, poxviruses, and polyomaviruses. The 
biological system of study chosen to demonstrate the role of a gene in disease or cellular 
phenotype is based upon knowledge in the art of the biological system, including a cell culture or 
animal model system, that mimics relevant biological features. 

Knockout of RNA Binding Protein RBP1 Expression 

Although inhibition of RNA binding proteins that bind and regulate multiple mRNAs is 
likely to have a more drastic effect on the phenotype of the cells, this procedure remains an 
important control to verify the critical importance of the mRNP complex. RNAi knock-out 
experiments were used to knock out the expression of the RNA binding protein RBP 1 . The 
results validated the utility of the invention for isolating functionally related groups of genes and 
rapidly identifying and validating novel pathway components relevant to drug discovery. RNAis 
and transfection materials were obtained from Darmachon. Smart pools of rRNAi were made by 
propietary algorithm to design up to 20 RNAis against the gene of interest to increase the 
likelihood of success. RNAis were transfected according to the manufacturer's protocols for 
transfection reagents. Results were verified by QRTPCR or Western blot. RNAi mediated 
knock-out of RBP 1 expression lead to increased basal insulin secretion levels and decreased 
cellular response to glucose. Readout is insulin secretion at basal levels (no glucose stimulation) 
versus insulin secretion upon high glucose stimulation. 

• * 

Knockout of the Expression of Two Ion Channel Protein Associated with RBP1 
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RNAi knock-out of two ion channel proteins resulted in significantly increased levels of 
basal insulin secretion. Knock out of two potassium channels caused extreme increase in basal 
insulin secretion levels, suggesting these channels play a functional role in the process. 

These two potassium channel proteins were not previously implicated in regulating 
rnsulin secretion or pancreatic beta cell function. This is significant, since the action of a class of 
diabetes drugs (sulfonyureas or gliburides like GLUCOVANCE) act by inhibiting a K + channel 
on the pancreatic beta cell. This inhibition leads to membmne depolarization, which allows 
calcium to enter the cell and stimulate release of intracellular secretory granules filled with 
msuhn. Th.se drugs act by increasing overall and basal insulin secretion, thereby controUing 
high glucose levels (hyperglycemia). These results suggest that there are additional K* channels 
that may work in this process and provide candidate targets for new diabetes drugs. 

RNAi and target validation. Initial target validation screens center around the RNA 
interference (RNAi) scheme. In this system INS-1 cells were transfected with siRNA 

smartpools (Dharmacon) 
that are chosen based on 
a proprietary algorithm. 
The smartpools are very 
effective; at least a 50% 
reduction on gene 

expression is detected by 
24-48hours post- 
transfection. As a first 
pass screen, and test of 
the RNAi system in the 
INS-1 cells, three ion 

channels (mnchl , 2, and 3) that have previously not been associated with glucose-stimulated 
insulin secretion (GSIS) were analysed. In addition, RNAi knockdown of PTB was also 
analysed to determine the effect of its decreased expression on GSIS. Finally, a mixed- 
sequence siRNA with no homology to any gene in the current rat database was used as a 
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Fig 5. Effect of PTB and tonCh sIRNAs on GSIS 

250 




Control PTB lonChl lonCh2 lonChS 
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control. Expression of target genes in untreated, control transfected and sequence-specific 
siRNA-transfected cells is assessed by QPCR and/or immunoblotting. Compared to cells 
transfected with the control siRNA, transfection of INS- 1 cells with siRNA to PTB or ionchl 
resulted in significantly elevated basal levels (3mm glucose) of insulin secretion (Fig 5). The 
ability of glucose to stimulate insulin secretion in sirna PTB cells was dimished, and 
although siRNA ionchl cells had an overall reduced fold-stimulation by glucose, there, was a 
significantly elevated level of total secreted insulin. Basal secretion in ionch2 or ionch3 
transfected cells was only slightly elevated and showed normal or slightly elevated glucose 
stimulation. In addition extensive time course experiments and glucose dose response and 
ability to respond to other secretagogues such as sulfonylureas, GLP-1 and fatty acids are 
performed. 

Over-exp ression of Target Proteins 

Alternatively, cells can be transfected with nucleic acids encoding target proteins or 
treated with a transcriptional enhancer for a gene encoding a target protein of interest, in order to 
overexpress a particular target protein identified by the methods of the invention. These systems 
would then be subject to biological assays (e.g., glucose-stimulated insulin secretion), as 
described above. 

In vitro validation 

As indicated above, it has been proposed that PTB binds to a pyrimidine-rich site in 
the V UTR of the proinsulin mRNA in a glucose dependent manner. To validate that PTB 
indeed is responsible, an oligonucleotide capture is used to demonstrate high affinity, 
sequence specific binding to the 3 'UTR element Briefly, biotinylated oligoribonucleotides 
encoding the entire 3' UTR (53nt) or 5' UTR (58nt) of rat preproinsulin, and controls are 
bound to streptavidin-agarose beads and exposed to cell or tissue lysates. Following washing 
of the beads (low to high salt) , selectively bound PTB is detected by elution with SDS 
PAGE sample buffer and immunoblot. Both endogenous PTB from INS-1 cells and 
recombinant GST-PTB (data not shown) can specifically associate with the 3 'UTR and that 
the association is stable to 300 mM NaCl. Using this system, the sequence specificity and 
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the ability of glucose to regulate PTB binding to the preproinsulin mRNA is examined. In 
these experiments cell extracts are prepared from INS-1 cells that have been incubated under 
low (3mM) to high (15mM) glucose. Control and sequence-specific biotinylated oligos are 
incubated with these .extracts, the beads washed and proteins bound to the beads analyzed by 
immunoblotting with a monoclonal antibody to PTB (Zymed). It is expected that if glucose 
can modulate the binding of PTB to the preproinsulin UTR an increase/decrease in PTB 
protein associated with the sequence-specific oligo is detected. 
In vivo validation 

To validate a functional role for PTB and the novel glucose-regulated RBP, a newly 
emergent RNA interference techniques (RNAi) is used to acutely reduce expression of the 
two genes in INS-1 cells. Pools of siRNAs (SmartPools from Dharmacon) designed 
specifically for PTB and the novel RBP gene are transfected into INS-1 cells at various 
concentration (InM -2G0nM) using the RNAi transection reagent TransIT-TKO (Mirus). 
Control siRNAs that do not have sequence homology to any current gene in the rat database 
are included in all experiments. Following transfection, cells are harvested at various time 
points 24-96 hours post transfection to verify reduction in expression of the RNA binding 
proteins at both the RNA and protein levels (antibody permitting). Following optimization of 
conditions for RNA binding protein suppression, cells are challenged with low and high 
glucose to determine affects on glucose stimulated insulin secretion (Elisa, Linco, Inc). It is 
expected that downregulation of PTB which binds to the preproinsulin mRNA should 

\ 

quantitatively or temporally affect insulin biosynthesis and/or secretion. This effect on 
insulin secretion is verified, which suggests that PTB and/or mRNAs associated with the 
cluster play a functional role in GSIS. According to the current model, in addition to PTB, at 
least one additional RBP involved in regulating insulin secretion or proinsulin biosynthesis is 
identified. 

Example 7: Identification of Common Untranslated Elements in mRNAs identified Using 
RAS™ 

Following application of the RAS™ assay, the subpopulation of mRNAs that are present 
in mRNP complexes can be identified and examined for the presence of common UTR sequence 
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elements, computational analysis for homology is not a reliable method for identifying 
Untranslated Sequence Elements for Regulation Codes (USER codes) because they are often 
dependent on secondary structure rather than just primary (sequence) sequence. In addition, the 
subpopulation of mRNAs can be examined for functional relationships. For example, each ' 
mRNA can be categorized by gene annotation and by known functions in functional genomics 
databases (e.g., Locus Link (NCBI, Bethesda, MD), GO Database (Gene Ontology™ 
Consortium), Proteome BioKnowledge® Library (Incyte Genomics, Inc., Palo Alto, CA)). For 
example, if the protein used in the RiboTrap™ assay is involved in immune regulation, the other 
mRNAs found in the same mRNP complex can be analyzed for their role in immune regulation. 
However, the mRNA could be bound indirectly through a different RNA binding protein in the 
mRNP complex (e.g., is assessed to the presence of USER code element in its UTR .that 
recognizes the RNA binding protein or other known binding sites for RNA binding proteins. 

A goal of the RAS™ assay is to identify mRNA populations in which the mRNAs have 
related structural features in their UTRs or the proteins encoded by the mRNAs have functional 
relationships. Among the related functions that are expected are a) involvement of encoded 
proteins in a common metabolic pathway, b) encoded proteins that are temporally co-regulated, 
c) encoded proteins that are similarly localized in or on the cell, d) encoded proteins that play a 
role in forming or regulating a biological machine (e.g., a ribosome). The identification of 
complex traits and phenotypes that result from the expression of a set of functionally-related 
proteins would include such processes as cognition, cell-specific activation, inflammation, or 
differentiation. While proteins known to be involved in these complex processes are known 
from other studies, the majority of the functions remain largely unknown. One of the values of 
the invention is for discovering a larger set of proteins involved in these processes that could 
serve as alternative drug targets or surrogate markers. 

Equivalents 

The invention may be embodied in other specific forms without departing from the spirit 
or essential characteristics thereof. The foregoing embodiments are therefore to be considered in 
all respects illustrative rather than limiting on the invention described herein. Scope of the 
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invention is thus indicated by the appended claims rather than by the foregoing description, and 
all changes that come within the meaning and range of equivalency of the claims are intended to 
be embraced therein. 

* 

* 

Incorporation bv Reference 

All publications and patent documents cited in this application are incorporated by 
reference in their entirety for all purposes to the same extent as if the contents of each individual 
publication or patent document was incorporated herein. 

We claim: 
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Claims 

■ 

■ 

1 . A method for identifying a therapeutic target, the method comprising the steps of 
comparing expression of at least one protein in a first sample to expression of said protein 
in a second sample, wherein said protein is selected from the group consisting of an RNA 
binding protein or an mRNP complex-associated protein, and identifying a differentially 
expressed RNA binding protein or mRNP complex-associated protein as a therapeutic 
target. 

2. The method of claim 1 , further comprising the step of identifying a member selected 

» 

from the group consisting of an mRNA encoding said protein, a gene encoding said protein, an 
mRNP complex comprising said protein, an mRNA associated with said mRNP complex, an 
mRNA associated with said protein, a gene encoding said mRNA associated with said mRNP 
complex, and a gene encoding said mRNA associated,with said RNA binding protein* as a 
second therapeutic target 

3. The method of claim 1 or 2, wherein said first sample is obtained from an individual 
exhibiting a phenotype indicative of disease and wherein said second sample is obtained from an 
individual not exhibiting said phenotype. 

4. The method of claim 3, wherein said phenotype is obesity. 

5. The method of claim 3, wherein said phenotype is diabetes. 

6. The method of claim 3, wherein said phenotype is hypoglycemia. 

7. The method of claim 1 or 2, wherein said first sample comprises differentiated cells and 
said second sample comprises copies of undifferentiated cells. 

■ 

8. . The method of claim 1 or 2, wherein said sample scomprises an adipocyte or a pancreatic 
beta cell. 

9. The method of claim 1 or 2 9 wherein said samples comprises a mature adipocyte. 
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10. The method of claim 1 or 2, wherein said samples comprise a preadipocyte. 

1 1 . The method of claim 1 or 2, wherein said samples comprise a hepatocyte. 

* 

12* The method of claim 1 or 2, wherein said RNA binding protein is a polypyrimidine tract 
binding (PTB) protein. 

1 3. The method of claim 1 or 2, wherein said RNA binding protein is a selected from the 
group consisting of ribonuclease P (30kD), nuclear receptor coactivator 5, DEAD/H 
(Asp-Glu-Ala- Asp/His) box polypeptide 28, RNA binding motif protein (Y chromosome, 
Al), LOC136197, exosome component Rrp46, zinc finger protein 85 (HPF4, HTF1), 
LOCI 34477, pterin-4 alpha-carbinolamine dehydratase, nuclear receptor co-repressor 2, 
IGF-II mRNA-binding protein 1, putative 28kDa protein, gonadotropin-reguiated 
testicular RNA helicase, LOC56902 (hypothetical protein XPJ)95071), primase 
(polypeptide 2A (58kD)), bol (boule-like, Drosophila), LOC151613, small nuclear 
ribonucleoprotein polypeptide G, nuclear factor of kappa light polypeptide gene enhancer 
in B-cells 2 (p49/pl00), heterogeneous nuclear ribonucleoprotein D, LOC163412 
(hypothetical protein XP_088868), ectonucleotideprophosphatase/phosphodiesterase 2, 
ribosomal protein S15a, LOC126205, huntingtin-interacting protein HYPA/FBP1 1, 
LOCI 38280 (similar to KIAA1 138 protein), LOC165271 (similar to hypothetical protein 
DKFZp434Bl727 from H. sapiens), ribonuclease (RNase A family, 2), human 
epididymis-specific 3 beta? LOC129715 (similar to tudor protein), ribosomal protein S21, 
2\5'-oligoadenylate synthetase 3, ring finger protein 17 isoform short, LOC127164 
(similar to TAR DNA binding protein), and 2 , ,5 , -oligoadenylate synthetase 1 (isoform 
E16). ' 

14. The method of claim 1 or 2, wherein said first sample is treated with an agent selected 1 
from the group consisting of insulin, insulin-like growth factor- 1 (IGF-1), p-adrenergic agonist, 
glucose, glucagon-like peptide-1 (GLP-1), fatty acid and thiazolidinediones and wherein said 
second sample is not treated with said agent. 
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15. A method for identifying a gene or gene product involved in a biochemical pathway in a 
cell, the method comprising the steps of: 

a. identifying at least one protein in a cell, wherein said protein is selected from the 
group consisting of an RNA binding protein and an mRNP complex-associated protein that 
differentially expressed in a first sample and a second sample, and wherein said differential 
expression is indicative that said protein is a therapeutic target; 



is 



b. isolating an mRNP protein complex comprising said protein and at least 
mRNA; and 



one 



c. identifying a component of said complex as being involved in a biochemical 
pathway. 

1 6. The method of claim 1 or 2, wherein said mRNA is selected from the group consisting of 
the genes listed in Figure 16. 

17. The method of claim 15, further comprising the step of contacting said first sample with 
an agent that interacts with or regulates a component of said biochemical pathway. 

18. The method of claim 15, wherein said agent inhibits said biochemical pathway. 

19. The method of claim 15, wherein said agent enhances said biochemical pathway. 

20. The method of claim 15, wherein said biochemical pathway is an insulin production 
pathway. 



21. 



22. 



The method of claim 1 5, wherein said biochemical pathway is a lipogenesis pathway. 



The method of claim 15, wherein said protein is a polypyrimidine tract binding (PTB) 

protein. 
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The method of claim 15, wherein said sample comprises preadipocytes, adipocytes or 
pancreatic beta cells. 
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24. A method for identifying an agent that interacts with or regulates a component of a N 
biochemical pathway, the method comprising the steps of: 

■ 

a. contacting a sample comprising a biochemical pathway with an agent; 

b. preparing a ribonomic profile of said agent-treated sample, wherein said 
ribonomic profile comprises expression of at least one component of said biochemical pathway 
associated with at least one mRNP complex, said component selected from the group consisting 
of an RNA binding protein, an mRNA, and an mRNP complex-associated protein; and 

c. comparing the level of expression of said component in said agent-treated sample 
to the level of expression of said component in a control sample, wherein a difference in said 
expression of said component is indicative that said agent is capable of interacting with or 
regulating said component. 

25. A method for assessing the efficacy of an agent as a therapeutic in treating an individual 
having a disease associated with altered glucose metabolism, the method comprising the steps of: 

a. contacting a sample from an individual having a disease with an agent; 

p ♦ 

b. preparing a ribonomic profile of said agent-treated sample, wherein said 
ribonomic profile comprises expression of at least one gene product associated with at least onje 
mRNP complex, said gene product selected from the group consisting of an RNA binding 
protein, an mRNA, and an mRNP complex-associated protein; and 

c. comparing the level of expression of said gene product in said agent-treated 
sample to the level of expression of said gene product in a control sample, wherein a difference 
in expression of said gene product is indicative that said agent is a therapeutic capable of treating 
said disease. > 

26. The method of claim 25, wherein said disease is diabetes. 



27. The method of claim 25, wherein said disease is obesity. 
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28. The method of claim 25, wherein said disease is hypoglycemia. 

29. The method of claim 25, wherein said sample comprises an adipocyte. 

30. The method of claim 25, wherein said sample comprises a pancreatic beta cell. 

■ 

31. The method of claim 25, wherein said cell is a hepatocyte. 

32. A method for identifying a protein that regulates glucose metabolism, the method 

i 

comprising the steps of: 

a. measuring said expression of at least one gene product of a pancreatic beta cell 
sample, wherein said gene product is selected from the group consisting of an RNA binding 
protein, an mRNA associated with said RNA binding protein, or an mRNP complex-associated 
protein; 

b. treating said pancreatic beta-cell sample with glucose, glucagon-like peptide-1 
(GLP-1), fatty acid or a thiazolidinedione; and 

* 

c. measuring expression of said protein after treatment, wherein a difference in 
expression of said protein after treatment compared to expression of said protein before 
treatment is indicative that said protein regulates glucose metabolism. 

33. A method of identifying gene products co-regulated with preproinsulin mRNA, the 
method comprising the steps of: 

* 

a. isolating an mRNP complex comprising a protein identified by the method of 
claim 1 or 2, and at least one mRNA or at least one mRNP complex-associated protein; and, 

t 

b. identifying said mRNA or said mRNP complex-associated protein, wherein said 
mRNA or said mRNP complex-associated protein are co-regulated with preproinsulin mRNA. 

p 
* 

34. A method for identifying a protein that regulates insulin production, the method 
comprising the steps of: 
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a. contacting a pancreatic beta cell sample with at least one probe, wherein said 
probe is capable of binding to at least one protein, wherein said protein is capable of binding to 
the 3' UTR or 5/ UTR of a preproinsulin mRNA; 



b. separating said probe from said protein; and 

♦ 

c. identifying said protein. 

35. The method of claim 34, wherein protein binds to a probe comprising a sequence selected 
from the group consisting of S'-gaauaaaaccuuugaaagagcacuac-S 1 , and 5- 
cccaccacuacccuguccaccccucugcaaug-3', and 5'- 
agccctaagtgaccagctacagtcggaaaccatcagcaagcaggtcattgttccaac-3 ' . 

36. An isolated composition of matter identified by the method of claim 24. 

37. An isolated mRNA protein complex identified by the method of claim 1 5. 

* 

38. An mRNP complex-associated with diabetes, wherein said mRNP complex comprises a 
polypyrimidine tract binding (PTB) protein, and at least one mRNA associated with said 
polypyrimidine tract binding protein. 

38. The method of claim 1 5, wherein said biochemical pathway is a glucose-uptake pathway. 



\ 



* 
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Abstract 

the identification and evaluation ofmRNA and protein targets associated with RNA 
binding proteins or mRNP complexes associated with glucose metabolism is described. In 
particular, the invention provides methods for identifying RNA binding proteins associated with 
glucose metabolism biochemical pathways and mRNAs that exhibit coordinated gene regulation 
across those pathways. Candidate targets are provided that are useful for treating diseases 
related to aberrant glucose metabolism, such as, for example, obesity and diabetes. 
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